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OurRef: 36541-0005 
October 6, 2000 
DELIVERED 

THE COMMISSIONER OF PATENTS AND TRADEMARKS 
WASHINGTON, D,C. 20231 

Dear Sir: 



We enclose the documents identified below in respect of a patent application in 
the name of Harold A. Robertson and Eileen M. Donovan-Wriqht for an 
T^nrS '^ GENE NECE5SARY F °R STRIATAL FUNCTION, USES 
receipf COMPOUNDS FOR MODULATING SAME" to be fried on 

Enclosed are: Specification, Claims & Abstract 
Declaration, Power of Attorney 
41 Sheets of Drawings 
Small Entity Declaration 
Cover Letter 
Assignment 

Assignment Recordation Form Cover Sheet 
Sequence listings on diskette 
Response Card 





CLAIMS FILED 


BASIC FEE 


Number Excess 
Rate 


S 355.00 


Total claims 
Independent claims 
Multiple dependencies 


19-20= x 
4-3=1 x 40 


S 40.00 




Total filing fees 


$ 395.00 j 
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October 6, 2000 




The Commissioner of Patents and Trademarks 



Please deduct the amount of $395.00 U.S. from our Deposit Account No 13- 
2400. Please also deduct the amount of $40.00 U.S. from our Deposit Account 
to cover the assignment recordation fee. Please deduct any additional fees 
required from our deposit account. An additional signed copy of this letter is 
attached. 

Should any Patent and Trademark Office Official want to telephone, the call 
should be made to Mr. David J. Heller (Registration No. 43,384) at (416) 868- 
1482. 



Yours very truly, 




David J. Heller 
(Registration No. 43,384) 



OJH;lc 
Ends. 




Applicant or Patentee: NovaNeuron Inc. Attorney's 

Serial or Patent No.: N/A Docket No: 

Filed or Issued: N/A 36541-0004 

Title: GENE NECESSARY FOR STRIATAL FUNCTION, USES THEREOF, AND 
COMPOUNDS FOR MODULATING SAME 



VERIFIED STATEMENT (DECLARATION) CLAIMING SMALL ENTITY STATUS - 
(37 CFR 1.9(f) & 1.27(c)).. SMALL BUSINESS CONCERN 

I hereby declare that I am 

the owner of the small business concern identified below: 

X an official of the small business concern empowered to act on 
behalf of the concern identified below: 



NAME OF SMALL BUSINESS CONCERN NovaNeuron Inc. 

ADDRESS OF SMALL BUSINESS CONCERN Room 15 D 7 , Tupper Building 

Halifax, N.S. B3H 4H7 

Canada 

I hereby declare that the above identified small business 
concern qualifies as a small business concern as defined in 13 CFR 
121.12, and reproduced in 37 CFR 1.9(d), for purposes of paying 
reduced fees to the United States Patent and Trademark Office, in 
that the number of employees of the concern, including those of its 
affiliates, does not exceed 500 persons. For purposes of this 
statement (1) the number of employees of the business concern is 
the average over the previous fiscal year of the concern of the 
persons employed on a full-time, part-time or temporary basis 
during each of the pay periods of the fiscal year, and (2) concerns 
are affiliates of each other when either, directly or indirectly, 
one concern controls or has the power to control the other, or a 
third party or parties controls or has the power to control both. 

I hereby declare that rights under contract or law have been 
conveyed to and remain with the small business concern identified 
above with regard to the invention, entitled GENE NECESSARY FOR 
STRIATAL FUNCTION. USES THEREOF. AND COMPOUNDS FOR MODULATING SAME 

by inventor (s) Harold A. Robertson et al . 

described in 

x the specification filed herewith 

application serial no. filed 

patent no. issued 
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If the rights held by the above identified small business 
concern are not exclusive, each individual, concern or organization 
having rights in the invention is listed below" and no rights to 
the invention are held by any person, other than the inventor (s), 
who would not qualify as an independent inventor under 37 CFR 
1.9(c) if that person made the invention, or by any concern which 
would not qualify as a small business concern under 37 CFR 1.9(d), 
or a nonprofit organization under 37 CFR 1.9(e). "NOTE: Separate 
verified statements are required from each named person, concern or 
organization having rights to the invention averring to their 
status as small entities (37 CFR 1.27). 



( ) INDIVIDUAL ( ) SMALL BUSINESS CONCERN ( ) NONPROFIT ORGANIZATION 



( ) INDIVIDUAL ( ) SMALL BUSINESS CONCERN ( ) NONPROFIT ORGANIZATION 



I acknowledge the duty to file, in this application or 
patent, notification of any change in status resulting in loss of 
entitlement to small entity status prior to paying, or at the time 
of paying, the earliest of the issue fee or any maintenance fee due 
after the date on which status as a small entity is no longer 
appropriate (37 CFR 1.28(b)). 

I hereby declare that all statements made herein of my own 
knowledge are true and that all statements made on information and 
belief are believed to be true; and further that these statements 
were made with the knowledge that willful false statements and the 
like so made are punishable by fine or imprisonment, or both, under 
section 1001 of Title 18 of the United States Code, and that such 
willful false statements may jeopardize the validity of the 
application, any patent issuing thereon, or any patent to which 
this verified statement is directed. 

NAME OF PERSON SIGNING Nl^'l &\Clu^xL 

TITLE OF PERSON IF OTHER THAN OWNER 9y-L^' Am^Jt . MGv/itolU^^ \ka C - 
ADDRESS OF PERSON SIGNING ivlCv?x<Og^-ryw 



NAME 



ADDRESS 



NAME 



ADDRESS 
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Gene Necessary for Striatal Function. Uses Thereof, and 



Compounds for Modulating Same 



CROSS-REFERENCE 

This patent claims priority from US provisional application no. 60/158,043 filed October 7, 
1999 and US provisional application no. 60/217,765 filed July 12, 2000, entitled Gene 
Necessary for Striatal Function, Uses Thereof, and Compounds for ModulatingSame. 

FIELD OF THE INVENTION 

The present invention relates to a polynucleotide, PDE10A, which is down-regulated during 
the development of CAG repeat disorders, such as Huntington's disease. The present 
invention also describes compounds that modulate CAG repeat disorders, processes for 
expressing PDE10A, and its agonists and antagonists, and uses of PDE10A, and its variants, 
derivatives, agonists and antagonists. 

BACKGROUND OF THE INVENTION 

Very few if any effective treatments exist for neurological disorders characterized by 
progressive cell loss, known as neurodegenerative diseases, as well as those involving acute 
cell loss, such as stroke and trauma. 

Huntington's disease (HD) is an inherited neurological disorder that is transmitted in 



1 



autosomal dominant fashion. HD results from genetically programmed degeneration of 
neurons in certain areas of the brain. Huntington's disease is caused by a mutation of the 
gene IT- 15 that codes for the protein huntingtin. The huntingtin gene contains a polymorphic 
stretch of repeated CAG trinucleotides that encode a polyglutamine tract within huntingtin. If 
this tract exceeds 35 in number, Huntington's disease results. Huntington's disease is only 
one of a number of neurological diseases which are characterised by these polyglutamine 
repeats (Ross, 1997). Schizophrenia, Alzheimer's disease, stroke, trauma, and Parkinson's 
disease also affect the basal ganglia. 

Huntingtin has no sequence similarity to known proteins (Group THDCR, 1993; Sisodia, 
1998). The function of the normal or mutated HD form of huntingtin has not been defined by 
the prior art. It is evident, however, that the expression of the HD form of huntingtin leads to 
progressive and selective neuronal loss. It has been demonstrated that the GABA- and 
enkephalin-containing medium spiny projection neurons of the caudate-putamen eventually 
die as a result of HD (Richfield et al., 1994). Patients with minimal cell loss, however, still 
present with motor and cognitive symptoms suggesting that neuronal dysfunction, and not 
simply cell loss, contribute to the symptoms of HD. The motor symptoms of HD include the 
development of chorea, dystonia, bradykinesia and tremors (Young et al., 1986). Voluntary 
movements may also be affected such that there may be disturbances in speech (Ludlow et 
al., 1987) and degradation of fine motor co-ordination (Young et al., 1986). In addition to 
motor decline, emotional disturbances and cognitive loss are also evident during the 
progression of HD (Caine et al, 1978). 

Despite the fact that huntingtin is ubiquitously expressed, HD specifically affects cells of the 
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basal ganglia, structures deep within the brain that have a number of important functions, 
including co-ordinating movement. The basal ganglia includes the caudate nucleus, the 
putamen, the nucleus accumbens and the olfactory tubercule. HD also affects the brain's 
outer surface, or cortex, which controls thought, perception, and memory. The mechanism by 
which only a small group of neurons in the striatum and cortex are rendered vulnerable to this 
ubiquitously expressed mutant protein is not known. There are no effective treatments for 
Huntington's disease. 

Huntington's disease is widely believed to be a gain-of function disorder but neither the 
normal function nor the gained function of huntingtin is known. Because the function for 
huntingtin is not known, there is little insight into the disease process. It was believed that 
huntingtin was related to neuronal intranuclear inclusions (Nil). However, recent results have 
cast doubt on our understanding of the role of the Nil in Huntington's disease (Saudou et al., 
1998) or in other CAG repeat disorders (Klement et al., 1998; see also commentary by 
Sisodia, 1998). 

The development of a mouse carrying the 5' end of the human Huntington's disease gene (the 
promoter and first exon; Mangiarini et al., 1996) was an important step in the development 
of the tools that will allow us to understand the function (and gain-of-function) associated 
with huntingtin. R6/2 mice exhibit a rapidly progressing neurological phenotype with onset 
at about 8 weeks. This phenotype includes a movement disorder characterised by shuddering, 
resting tremor, epileptic seizures and stereotyped behaviour. These symptoms suggest that 
the function of the basal ganglia is affected by the expression of the human exon 1 transgene 
prior to neuronal cell death. By 12 weeks the affected mice have significantly reduced brain 
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weights and they die by about 13 weeks of age. Neuronal intranuclear inclusions (Nil) 
develop at about 4 weeks (Davies et al., 1997). As is observed in human Huntington's 
disease patient, the R6/2 mice show changes in neuronal receptors (Cha et al., 1998). The 
present inventors have also demonstrated that changes in the expression of DARPP-32 and 
cannabinoid receptors change over time in HD mice; such changes have also been observed 
in human Huntington's disease patients (unpublished results). The loss of the cannabinoid 
receptor is one of the earliest documented changes that occur prior to neuronal degeneration 
in human HD patients. The R6/2 model, therefore, mimics the early phases of HD; a point in 
disease development where intervention would be most appropriate. 

Human PDE10 was recently identified by identification of cDNA fragments published on the 
National Center for Biotechnology Information (NCBI) Expressed Sequence Tags (EST) 
database (Loughney et al., W099/42596). While PDE10 was found to share homology with 
known PDEs, no function could be identified for PDE10. 

SUMMARY OF THE INVENTION 

The present invention provides the function and uses of a nucleotide segment, PDE10A, and 
compounds which inhibit or promote the development of CAG repeat disorders such as 
Huntington's Disease. 

The invention teaches a method for identifying a compound which inhibits or promotes a 
CAG repeat disorder, comprising the steps of: (a) selecting a control animal having PDE10A 
and a test animal having PDE10A; (b) treating said test animal using a compound; and (c) 
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determining the relative quantity of RNA corresponding to PDE10A, as between said 
animals. In an embodiment, the animal is a mammal, preferably a mouse, and preferably a 
transgenic mouse. In an embodiment, the CAG repeat disorder is Huntington's disease. 

The invention also teaches a method for identifying a compound which inhibits or promotes a 
CAG repeat disorder, comprising the steps of: (a) selecting a host cell containing PDE10A; 
(b) cloning said host cell and separating said clones into a test group and a control group; (c) 
treating said test group using a compound; and (c) determining the relative quantity of RNA 
corresponding to PDE10A, as between said test group and said control group. In an 
embodiment, the CAG repeat disorder is Huntington's disease. 

The invention further teaches a method for detecting the presence of or the predisposition for 
a CAG repeat disorder, said method comprising determining the level of expression of RNA 
corresponding to PDE10A in an individual relative to a predetermined control level of 
expression, wherein a decreased expression of said RNA as compared to said control is 
indicative of a CAG repeat disorder. Preferably, the expression is measured by in situ 
hybridization, fluorescent in situ hybridization, polymerase chain reaction, or DNA 
fingerprinting technique. In an embodiment, the CAG repeat disorder is Huntington's 
disease. 

The invention further teaches compositions for treating a CAG repeat disorder comprising a 
compound which modulates PDE10 expression and a pharmaceutically acceptable carrier. 
The compound can be selected from the group consisting of: quinpirole, alloxan, miconazole 
nitrate, MDL-12330A and tetracyline derivatives such as demeclocycline. The compound 
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may be selected from the group consisting of: (6R,12aR)-2,3,6,7,12,12a-Hexahydro-6-(5- 
benzofuranyl)-2-methyl-pyrazino[2', r:6,l]pyrido[3,4-b]indole-l,4-dione, 
(6RJ2aR)-2,3,6J42J2a-Hexahydro-6-(5-benzofuranyl)-pyrazino[2 , ,l':6,l]py rido[3,4- 
]indole- 1 ,4-dione, (6R, 1 2aR)-2,3,6,7, 12,1 2a-Hexahydro-6-(5-benzofuranyl)-2-isopropyl- 
pyrazino[ 2',V:6,l ]pyrido[3,4-b]indole- 1 ,4-dione, (3 S,6R, 1 2aR)-2,3 ,6,7, 1 2, 1 2a-Hexahydro-6- 
(5 -benzofuranyl)-3 -methyl-pyrazino [ 2' , 1' : 6, 1 Jpyrido [3 ,4-b]indole- 1 ,4-dione, and 
(3S,6R,12aR)-2,3,6,7,12,12a-Hexahydro-6-(5-benzofuranyl)-2,3-dimethyl-pyraz 
ino[2',r:6,l]pyrido[3,4-b]indole-l,4-dione, or from the group consisting of: KS-505, 
IC224,SCH 51866, IBMX and Dipyridamole. The disorder can be HD. 

The invention also teaches the use of a composition which modulates PDE10 for treating a 
CAG repeat disorder comprising administering the composition to a subject in need of such 
treatment, and such use of the composition which modulates PDE10 for treating HD. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a portion of an autoradiogram of the differential display reaction identifying 
PDE10A in mouse brain mRNA. 

FIG. 2 is a northern blot confirming that PDE10A has a lower steady-state level of expression 
in the striatum of transgenic HD mice. 

FIG. 3 is a nucleotide sequence of the differential display cDNA fragment of pPDElOA. 
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FIG. 4 shows the in situ hybridization of probe 1 to coronal and saggital brain sections of 10 
week-old wild-type and HD mice. 

FIG. 5 shows the in situ hybridization corresponding to spatial and temporal expression of 
PDE10A in brain sections of wild-type and HD mice over the period of time that the HD 
mice develop abnormal movements and postures. 

FIG. 6 shows the in situ hybridization corresponding to expression of PDE10A in brain 
sections of one day old wild-type and HD mice. 

FIG. 7 shows the in situ hybridization corresponding to distribution of the mRNA of 
PDE10A in mouse striatal neurons. 

FIG. 8 is the in situ hybridization corresponding to mRNA distribution of the rat homologue 
of PDE1 OA in rat brain tissue. 

FIG. 9 shows a Southern blot analysis of DNA from wild-type and transgenic HD mice 
hybridized to the pPDElOA cDNA probe. 

FIG. 10 is a nucleotide sequence of cPDE10-l, and corresponds to SEQ ID NO. 1. 
FIG. 11 is a restriction map of cPDE10-l. 

FIG. 12 is a nucleotide sequence of cPDE10-2, and corresponds to SEQ ID NO. 2. 
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FIG. 13 is a restriction map of cPDE10-2. 

FIG. 14 is a schematic diagram showing the alignment of cPDE10-l and -2 and the regions 
that are identical and unique between the two clones. 

FIG. 15 is a nucleotide sequence of cPDElOA and RACEs. corresponding to SEQ ID NO. 1 1. 
FIG. 16 is a map of PDE10A coding sequence and restriction sites. 
FIG. 17 is a map of PDE10A coding sequence and features. 
FIG. 18 is a restriction map of PDEIOA. 

FIG. 19 is a nucleotide sequence of cPDElOA and corresponds to SEQ ID NO. 12. 

DETAILED DESCRIPTION OF EMBODIMENTS OF THE INVENTION 

The following illustrative explanations are provided to facilitate understanding of certain 
terms used frequently herein The explanations are provided as a convenience and are not 
limitative of the invention. 

"Host cell" is a cell which has been transformed or transfected, or is capable of 
transformation or transfection by an exogenous polynucleotide sequence. 

"Identity", "similarity" or "homologous", as used in the art. are relationships between two or 
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more polynucleotide sequences, as determined by comparing the sequences. In the art, 
identity also means the degree of sequence relatedness between polynucleotide sequences, as 
the case may be, as determined by the match between strings of such sequences. Both 
identity and similarity can be readily calculated (Lesk, A. M., 1988; Smith, D. W., 1993; 
Griffin, A. M., and Griffin, H. G., 1994; von Heinje, G., 1987; and Gribskov, M. and 
Devereux, J., 1991). While there exist a number of methods to measure identity and 
similarity between two polynucleotide sequences, both terms are well known to skilled 
artisans (von Heinje, G., 1987; Gribskov, M. and Devereux, 1991; and Carillo, H., and 
Lipman, D., 1988). Methods commonly employed to determine identity or similarity 
between sequences include, but are not limited to those disclosed in Carillo, H., and Lipman, 
D. (1988). Methods to determine identity and similarity are codified in computer programs. 
Computer program methods to determine identity and similarity between two sequences 
include, but are not limited to, GCG program package (Devereux, J., et al., 1984), BLASTP, 
BLASTN, and FASTA (Atschul, S. F. et al., 1990). 

"Isolated" means altered "by the hand of man" from its natural state; i.e., that, if it occurs in 
nature, it has been changed or removed from its original environment, or both. For example, 
a naturally occurring polynucleotide naturally present in a living organism in its natural state 
is not "isolated," but the same polynucleotide separated from coexisting materials of its 
natural state is "isolated", as the term is employed herein. As part of or following isolation, 
such polynucleotides can be joined to other polynucleotides, such as DNA, for mutagenesis, 
to form fusion proteins, and for propagation or expression in a host, for instance. The 
isolated polynucleotides, alone or joined to other polynucleotides such as vectors, can be 
introduced into host cells, in culture or in whole organisms. Introduced into host cells in 
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culture or in whole organisms, such DNA still would be isolated, as the term is used herein, 
because they would not be in their naturally occurring form or environment. Similarly, the 
polynucleotides may occur in a composition, such as a media formulations, solutions for 
introduction of polynucleotides, for example, into cells, compositions or solutions for 
chemical or enzymatic reactions, for instance, which are not naturally occurring 
compositions, and, therein remain isolated polynucleotides within the meaning of that term as 
it is employed herein. 

"Plasmids". Starting plasmids disclosed herein are either commercially available, publicly 
available, or can be constructed from available plasmids by routine application of well 
known, published procedures. Many plasmids and other cloning and expression vectors that 
can be used in accordance with the present invention are well known and readily available to 
those of skill in the art. Moreover, those of skill readily may construct any number of other 
plasmids suitable for use in the invention. 

"Polynucleotides(s)" of the present invention may be in the form of RNA, such as rnRNA, or 
in the form of DNA, including, for instance, cDNA and genomic DNA obtained by cloning or 
produced by chemical synthetic techniques or by a combination thereof. The DNA may be 
double-stranded or single- stranded. Single-stranded polynucleotides may be the coding 
strand, also known as the sense strand, or it may be the non-coding strand, also referred to as 
the anti-sense strand. Polynucleotides generally refers to any polyribonucleotide or 
polydeoxribonucleotide, which may be unmodified RNA or DNA or modified RNA or DNA. 
Thus, for instance, polynucleotides as used herein refers to, among others, single- and double- 
stranded DNA, DNA that is a mixture of single- and double-stranded regions or single-, 
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double- and triple-stranded regions, single- and double-stranded RNA, and RNA that is 
mixture of single- and double- stranded regions, hybrid molecules comprising DNA and RNA 
that may be single-stranded or, more typically, double-stranded, or triple-stranded, or a 
mixture of single- and double- stranded regions. In addition, polynucleotide as used herein 
refers to triple- stranded regions comprising RNA or DNA or both RNA and DNA. The 
strands in such regions may be from the same molecule or from different molecules. The 
regions may include all of one or more of the molecules, but more typically involve only a 
region of some of the molecules. One of the molecules of a triple-helical region often is an 
oligonucleotide. As used herein, the term polynucleotide also includes DNA or DNA that 
contain one or more modified bases. Thus, DNA or DNA with backbones modified for 
stability or for other reasons are "polynucleotides" as that term is intended herein. Moreover, 
DNA or DNA comprising unusual bases, such as inosine, or modified bases, such as 
tritylated bases, to name just two examples, are polynucleotides as the term is used herein. It 
will be appreciated that a great variety of modifications have been made to DNA and RNA 
that serve many useful purposes known to those of skill in the art. The term polynucleotide as 
it is employed herein embraces such chemically, enzymatically or metabolically modified 
forms of polynucleotides, as well as the chemical forms of DNA and RNA characteristic of 
viruses and cells, including simple and complex cells, inter alia. Polynucleotides embraces 
short polynucleotides often referred to as oligonucleotide(s). It will also be appreciated that 
RNA made by transcription of this doubled stranded nucleotide sequence, and an antisense 
strand of a nucleic acid molecule of the invention or an oligonucleotide fragment of the 
nucleic acid molecule, are contemplated within the scope of the invention. An antisense 
sequence is constructed by inverting the sequence of a nucleic acid molecule of the invention, 
relative to its normal presentation for transcription. Preferably, an antisense sequence is 
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constructed by inverting a region preceding the initiation codon or an unconsented region. 
The antisense sequences may be constructed using chemical synthesis and enzymatic ligation 
reactions using procedures known in the art. 

"Stringent hybridization conditions" are those which are stringent enough to provide 
specificity, reduce the number of mismatches and yet are sufficiently flexible to allow 
formation of stable hybrids at an acceptable rate. Such conditions are known to those skilled 
in the art and are described, for example, in Sambrook, et al, (1989). By way of example 
only, stringent hybridization with short nucleotides may be carried out at 5-10° below the T M 
using high concentrations of probe such as 0.01-1.0 pmole/ml. Preferably, the term "stringent 
conditions" means hybridization will occur only if there is at least 95% and preferably at least 
97% identity between the sequences. 

"Variant(s)" of polynucleotides are polynucleotides that differ in nucleotide sequence from 
another, reference polynucleotide. Generally, differences are limited so that the nucleotide 
sequences of the reference and the variant are closely similar overall and, in many regions, 
identical. Changes in the nucleotide sequence of the variant may be silent. That is, they may 
not alter the amino acids encoded by the polynucleotide. Where alterations are limited to 
silent changes of this type a variant will encode a polypeptide or polynucleotide with the 
same amino acid sequence as the reference. Changes in the nucleotide sequence of the 
variant may alter the amino acid sequence of a polypeptide encoded by the reference 
polynucleotide. Such nucleotide changes may result in amino acid substitutions, additions, 
deletions, fusions and truncations in the polypeptide or polynucleotide encoded by the 
reference sequence. 



12 



As hereinbefore mentioned, the present inventors have identified and sequenced a DNA 
sequence encoding PDE10A. The DNA sequence is shown in the Sequence Listing as SEQ 
IDNO:l,NO:2 andNO:ll. 

It will be appreciated that the invention includes nucleotide or amino acid sequences which 
have substantial sequence homology with the nucleotide sequences shown in the Sequence 
Listing as SEQ ID NO: 1, NO:2 and NO: 1 1 . The term "sequences having substantial 
sequence homology" means those nucleotide and amino acid sequences which have slight or 
inconsequential sequence variations from the sequences disclosed in the Sequence Listing as 
SEQ ID NO: 1 , NO:2 and NO: 1 1 ; i.e. the homologous sequences function in substantially 
the same manner to produce substantially the same polypeptides as the actual sequences. The 
variations may be attributable to local mutations or structural modifications. It is expected 
that a sequence having 85-90% sequence homology with the DNA sequence of the invention 
will provide a functional PDE10 polypeptide. 

As used herein, "PDE10A" comprises a polynucleotide sequence which is down regulated in 
the course of CAG repeat disorders selected from the group consisting of: (a) a sequence 
comprising SEQ ID NO:l; (b) a sequence comprising SEQ ID NO:2; (c) a sequence 
comprising SEQ ID NO: 1 1 ; (d) a sequence comprising nucleotides 257 to 2596 of SEQ ID 
NO: 1 1 ; (e) a sequence which is at least 90% homologous with a sequence of (a), (b), (c) or 
(d), and; (f) a sequence which hybridizes to (a), (b), (c) or (d) under stringent conditions. In 
an embodiment, the isolated polynucleotide segment is cDNA. The invention also teaches an 
isolated polynucleotide segment, which retains substantially the same biological function or 
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activity as the polynucleotide encoded by the polynucleotide sequence. 

Further embodiments of the invention are polynucleotides that are at least 70% identical over 
their entire length to a polynucleotide encoding PDE10 polypeptide or polynucleotide, and 
polynucleotides which are complementary to such polynucleotides. Other embodiments are 
polynucleotides that comprise a region that is at least 80% identical over their entire length to 
a polynucleotide encoding PDE10 of SEQ ID NO.l 1 and polynucleotides complementary 
thereto. This includes polynucleotides at least 90% identical over their entire length to the 
same, and among these embodiments are polynucleotides with at least 95%. Furthermore, 
those with at least 97% are highly preferred among those with at least 95%, and among these 
those with at least 98% and at least 99% are particularly highly preferred, with at least 99% 
being the more preferred. 

The polynucleotides of the present invention may be employed as research reagents and 
materials for discovery of treatments of and diagnostics for disease, particularly human 
disease, as further discussed herein. 

Analysis of the complete nucleotide and amino acid sequences of the protein of the invention 
using the procedures of Sambrook et al., supra, have been used to determine the expressed 
region, initiation codon and untranslated sequences of the PDE10A gene. The transcription 
regulatory sequences of the gene are determined by analyzing fragments of the DNA for their 
ability to express a reporter gene such as the bacterial gene lacZ. 

The nucleic acid molecules of the invention allow those skilled in the art to construct 
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nucleotide probes for use in the detection of nucleotide sequences in biological materials. As 
shown in FIG. 11, 13, 15 and 16, a number of unique restriction sequences for restriction 
enzymes are incorporated in the nucleic acid molecule identified in the Sequence Listing as 
SEQ ID NO:l, NO:2 and NO:l 1, and these provide access to nucleotide sequences which 
code for polypeptides unique to the PDE10A polypeptide of the invention. Nucleotide 
sequences unique to PDE10A or iso forms thereof, can also be constructed by chemical 
synthesis and enzymatic ligation reactions carried out by procedures known in the art. 

A nucleotide probe may be labeled with a detectable marker such as a radioactive label which 
provides for an adequate signal and has sufficient half- life such as 32p, 3H, 14C or the like. 
Other detectable markers which may be used include antigens that are recognized by a 
specific labeled antibody, fluorescent compounds, enzymes, antibodies specific for a labeled 
antigen, and chemilurninescent compounds. An appropriate label may be selected having 
regard to the rate of hybridization and binding of the probe to the nucleotide to be detected 
and the amount of nucleotide available for hybridization. The nucleotide probes may be used 
to detect genes related to or analogous to PDE10A of the invention. 

Accordingly, the present invention also provides a method of detecting the presence of 
nucleic acid molecules encoding a polypeptide related to or analogous to PDE10A in a 
sample comprising contacting the sample under hybridization conditions with one or more of 
the nucleotide probes of the invention labeled with a detectable marker, and determining the 
degree of hybridization between the nucleic acid molecule in the sample and the nucleotide 
probes. 



15 



Hybridization conditions which may be used in the method of the invention are known in the 
art and are described for example in Sambrook J, et al., supra. The hybridization product 
may be assayed using techniques known in the art. The nucleotide probe may be labeled with 
a detectable marker as described herein and the hybridization product may be assayed by 
detecting the detectable marker or the detectable change produced by the detectable marker. 

The nucleic acid molecule of the invention also permits the identification and isolation, or 
synthesis of nucleotide sequences which may be used as primers to amplify a polynucleotide 
molecule of the invention, for example in polymerase chain reaction (PCR). The length and 
bases of the primers for use in the PCR are selected so that they will hybridize to different 
strands of the desired sequence and at relative positions along the sequence such that an 
extension product synthesized from one primer when it is separated from its template can 
serve as a template for extension of the other primer into a nucleic acid of defined length. 

Primers which may be used in the invention are oligonucleotides i.e. molecules containing 
two or more deoxyribonucleotides of the nucleic acid molecule of the invention which occur 
naturally as in a purified restriction endonuclease digest or are produced synthetically using 
techniques known in the art such as, for example, phosphotriester and phosphodiester 
methods (See Good et al, 1977) or automated techniques (see, for example, Conolly, B. A., 
1987). The primers are capable of acting as a point of initiation of synthesis when placed 
under conditions which permit the synthesis of a primer extension product which is 
complementary to the DNA sequence of the invention e.g. in the presence of nucleotide 
substrates, an agent for polymerization such as DNA polymerase and at suitable temperature 
and pH. Preferably, the primers are sequences that do not form secondary structures by base 
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pairing with other copies of the primer or sequences that form a hair pin configuration. The 
primer may be single or double-stranded. When the primer is double- stranded it may be 
treated to separate its strands before using it to prepare amplification products. The primer 
preferably contains between about 7 and 25 nucleotides. 

The primers may be labeled with detectable markers which allow for detection of the 
amplified products. Suitable detectable markers are radioactive markers such as P-32, S-35, 1- 
125, and H-3, luminescent markers such as chemiluminescent markers, preferably luminol, 
and fluorescent markers, preferably dansyl chloride, fluorcein-5-isothiocyanate, and 4-fluor- 
7-nitrobenz-2-axa-l,3 diazole, enzyme markers such as horseradish peroxidase, alkaline 
phosphatase, .beta.-galactosidase, acetylcholinesterase, or biotin. 

It will be appreciated that the primers may contain non-complementary sequences provided 
that a sufficient amount of the primer contains a sequence which is complementary to a 
nucleic acid molecule of the invention or oligonucleotide sequence thereof, which is to be 
amplified. Restriction site linkers may also be incorporated into the primers allowing for 
digestion of the amplified products with the appropriate restriction enzymes facilitating 
cloning and sequencing of the amplified product. 

Thus, a method of determining the presence of a nucleic acid molecule having a sequence 
encoding PDE10A or a predetermined oligonucleotide fragment thereof in a sample, is 
provided comprising treating the sample with primers which are capable of amplifying the 
nucleic acid molecule or the predetermined oligonucleotide fragment thereof in a polymerase 
chain reaction to form amplified sequences, under conditions which permit the formation of 
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amplified sequences and, assaying for amplified sequences. 

The polymerase chain reaction refers to a process for amplifying a target nucleic acid 
sequence as generally described in Innis et al, Academic Press, 1989, in Mullis el al., U.S. 
Pat. No. 4,863,195 and Mullis, U.S. Pat. No. 4,683,202 which are incorporated herein by 
reference. Conditions for amplifying a nucleic acid template are described in M. A. Innis and 
D. H. Gelfand, 1989, which is also incorporated herein by reference. 

The amplified products can be isolated and distinguished based on their respective sizes using 
techniques known in the art. For example, after amplification, the DNA sample can be 
separated on an agarose gel and visualized, after staining with ethidium bromide, under ultra 
violet (UV) light. DNA may be amplified to a desired level and a further extension reaction 
may be performed to incorporate nucleotide derivatives having detectable markers such as 
radioactive labeled or biotin labeled nucleoside triphosphates. The primers may also be 
labeled with detectable markers. The detectable markers may be analyzed by restriction and 
electrophoretic separation or other techniques known in the art. 

The conditions which may be employed in the methods of the invention using PCR are those 
which permit hybridization and amplification reactions to proceed in the presence of DNA in 
a sample and appropriate complementary hybridization primers. Conditions suitable for the 
polymerase chain reaction are generally known in the art. For example, see M. A. Innis and 
D. H. Gelfand, 1989, which is incorporated herein by reference. Preferably, the PCR utilizes 
polymerase obtained from the thermophilic bacterium Thermus aquatics (Taq polymerase, 
GeneAmp Kit, Perkin Elmer Cetus) or other thermostable polymerase may be used to amplify 
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DNA template strands. 

It will be appreciated that other techniques such as the Ligase Chain Reaction (LCR) and 
Nucleic- Acid Sequence Based Amplification (NASBA) may be used to amplify a nucleic 
acid molecule of the invention. In LCR, two primers which hybridize adjacent to each other 
on the target strand are ligated in the presence of the target strand to produce a 
complementary strand (Barney, 1991 and European Published Application No. 0320308, 
published Jun. 14, 1989). NASBA is a continuous amplification method using two primers, 
one incorporating a promoter sequence recognized by an RNA polymerase and the second 
derived from the complementary sequence of the target sequence to the first primer (U.S. Ser. 
No. 5,130,238 to Malek). 

The present invention also teaches vectors which comprise a polynucleotide or 
polynucleotides of the present invention, host cells which are genetically engineered with 
vectors of the invention and the production of polynucleotides of the invention by 
recombinant techniques. 

In accordance with this aspect of the invention the vector may be, for example, a plasmid 
vector, a single or double-stranded phage vector, a single or double-stranded RNA or DNA 
viral vector. In certain embodiments in this regard, the vectors provide for specific 
expression. Such specific expression may be inducible expression or expression only in 
certain types of cells or both inducible and cell-specific. Particular among inducible vectors 
are vectors that can be induced for expression by environmental factors that are easy to 
manipulate, such as temperature and nutrient additives. A variety of vectors suitable to this 
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aspect of the invention, including constitutive and inducible expression vectors for use in 
prokaryotic and eukaryotic hosts, are well known and employed routinely by those of skill in 
the art. Such vectors include, among others, chromosomal, episomal and virus-derived 
vectors, e.g., vectors derived from bacterial plasmids, from bacteriophage, from transposons, 
from yeast episomes, from insertion elements, from yeast chromosomal elements, from 
viruses such as baculoviruses, papova viruses, such as SV40, vaccinia viruses, adenoviruses, 
fowl pox viruses, pseudorabies viruses and retroviruses, and vectors derived from 
combinations thereof, such as those derived from plasmid and bacteriophage genetic 
elements, such as cosmids and phagemids, all may be used for expression in accordance with 
this aspect of the present invention. 

The following vectors, which are commercially available, are provided by way of example. 
Among vectors for use in bacteria are pQE70, pQE60 and pQE-9, available from Qiagen; 
pBS vectors, Phagescript vectors, Bluescript vectors, pNH8A, pNH16a, pNH18A, pNH46A, 
available from Stratagene; andptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 available 
from Pharmacia, and pBR322 (ATCC 37017). Among eukaryotic vectors are pWLNEO, 
pSV2CAT, pOG44, pXTl and pSG available from Stratagene; and pSVK3, pBPV, pMSG 
and pSVL available from Pharmacia. These vectors are listed solely by way of illustration of 
the many commercially available and well known vectors that are available to those of skill in 
the art for use in accordance with this aspect of the present invention. It will be appreciated 
that any other plasmid or vector suitable for, for example, introduction, maintenance, 
propagation or expression of a polynucleotide or polypeptide of the invention in a host may 
be used in this aspect of the invention. Generally, any vector suitable to maintain, propagate 
or express polynucleotides to express a polypeptide or polynucleotide in a host may be used 
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for expression in this regard. 



The appropriate DNA sequence may be inserted into the vector by any of a variety of well- 
known and routine techniques. In general, expression constructs will contain sites for 
transcription initiation and termination, and, in the transcribed region, a ribosome binding site 
for translation. The coding portion of the mature transcripts expressed by the constructs will 
include a translation initiating AUG at the beginning and a termination codon appropriately 
positioned at the end of the polynucleotide to be translated. 

The DNA sequence in the expression vector is operatively linked to appropriate expression 
control sequence(s), including, for instance, a promoter to direct mRNA transcription. 
Promoter regions can be selected from any desired gene using vectors that contain a reporter 
transcription unit lacking a promoter region, such as a chloramphenicol acetyl transferase 
("CAT") transcription unit, downstream of restriction site or sites for introducing a candidate 
promoter fragment; i.e., a fragment that may contain a promoter. As is well known, 
introduction into the vector of a promoter-containing fragment at the restriction site upstream 
of the cat gene engenders production of CAT activity, which can be detected by standard 
CAT assays. Vectors suitable to this end are well known and readily available, such as 
pKK232-8 and pCM7. Promoters for expression of polynucleotides of the present invention 
include not only well known and readily available promoters, but also promoters that readily 
may be obtained by the foregoing technique, using a reporter gene. Among known 
prokaryotic promoters suitable for expression of polynucleotides and polypeptides in 
accordance with the present invention are the E. coli lad and lacZ and promoters, the T3 and 
T7 promoters, the gpt promoter, the lambda PR, PL promoters and the trp promoter. Among 
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known eukaryotic promoters suitable in this regard are the CMV immediate early promoter, 
the HSV thymidine kinase promoter, the early and late SV40 promoters, the promoters of 
retroviral LTRs, such as those of the Rous sarcoma virus ("RSV"), and metallothionein 
promoters, such as the mouse metallothionein-I promoter. 

Vectors for propagation and expression generally will include selectable markers and 
amplification regions, such as, for example, those set forth in Sambrook et al., supra. 

As hereinbefore mentioned, the present invention also teaches host cells which are genetically 
engineered with vectors of the invention. 

Polynucleotide constructs in host cells can be used in a conventional manner to produce the 
gene product encoded by the recombinant sequence. The PDE10A polynucleotide or 
polypeptide products or isoforms or parts thereof, may be obtained by expression in a suitable 
host cell using techniques known in the art. Suitable host cells include prokaryotic or 
eukaryotic organisms or cell lines, for example bacterial, mammalian, yeast, or other fungi, 
viral, plant or insect cells. Methods for transforming or transfecting cells to express foreign 
DNA are well known in the art (See for example, Itakura et al., U.S. Pat. No. 4,704,362; 
Hinnen et al, 1978; Murray et al., U.S. Pat. No. 4,801,542; Upshall et al., U.S. Pat. No. 
4,935,349; Hagen et al., U.S. Pat. No. 4,784,950; Axel et al., U.S. Pat. No. 4,399,216; 
Goeddal et al., U.S. Pat. No. 4,766,075; and Sambrook et al, 1989, all of which are 
incorporated herein by reference). Representative examples of appropriate hosts include 
bacterial cells, such as streptococci, staphylococci, E. coli, streptomyces and Bacillus subtilis 
cells; fungal cells, such as yeast cells and Aspergillus cells; insect cells such as Drosophila S2 
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and Spodoptera Sf9 cells; animal cells such as CHO, COS, HeLa, C127, 3T3, BHK, 293 and 
Bowes melanoma cells; and plant cells. 

Host cells can be genetically engineered to incorporate polynucleotides and express 
polynucleotides of the present invention. Introduction of polynucleotides into the host cell 
can be affected by calcium phosphate transfection, DEAE-dextran mediated transfection, 
transvection, microinjection, cationic lipid-mediated transfection, electroporation, 
transduction, scrape loading, ballistic introduction, infection or other methods. Such methods 
are described in many standard laboratory manuals, such as Davis et al. (1986) and Sambrook 
etal. (1989). 

As hereinbefore mentioned, the present invention also teaches the production of 
polynucleotides of the invention by recombinant techniques. 

The PDE10 polynucleotides encode a polypeptide which is the mature protein plus additional 
amino or carboxyl-terminal amino acids, or amino acids interior to the mature polypeptide 
(when the mature form has more than one polypeptide chain, for instance). Such sequences 
may play a role in processing of a protein from precursor to a mature form, may allow protein 
transport, may lengthen or shorten protein half-life or may facilitate manipulation of a protein 
for assay or production, among other things. As generally is the case in vivo, the additional 
amino acids may be processed away from the mature protein by cellular enzymes. 

A precursor protein, having the mature form of the polypeptide fused to one or more 
prosequences may be an inactive form of the polypeptide. When prosequences are removed 
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such inactive precursors generally are activated. Some or all of the prosequences may be 
removed before activation. Generally, such precursors are called proproteins. 

In sum, a polynucleotide of the present invention may encode a mature protein, a mature 
protein plus a leader sequence (which may be referred to as a preprotein), a precursor of a 
mature protein having one or more prosequences which are not the leader sequences of a 
preprotein, or a preproprotein, which is a precursor to a proprotein, having a leader sequence 
and one or more prosequences, which generally are removed during processing steps that 
produce active and mature forms of the polypeptide. 

The polypeptides of the invention may be prepared by culturing the host/vector systems 
described above, in order to express the recombinant polypeptides. Recombinantly produced 
PDE10A based protein or parts thereof, may be further purified using techniques known in 
the art such as commercially available protein concentration systems, by salting out the 
protein followed by dialysis, by affinity chromatography, or using anion or cation exchange 
resins. 

Mature proteins can be expressed in mammalian cells, yeast, bacteria, or other cells under the 
control of appropriate promoters. Cell-free translation systems can also be employed to 
produce such proteins using DNA derived from the DNA constructs of the present invention. 
Appropriate cloning and expression vectors for use with prokaryotic and eukaryotic hosts are 
described by Sambrook et al., supra. 

Polynucleotides of the invention, encoding the heterologous structural sequence of a 
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polynucleotide or polypeptide of the invention generally will be inserted into a vector using 
standard techniques so that it is operably linked to the promoter for expression. The 
polynucleotide will be positioned so that the transcription start site is located appropriately 5' 
to a ribosome binding site. The ribosome binding site will be 5' to the AUG that initiates 
translation of the polynucleotide or polypeptide to be expressed. Generally, there will be no 
other open reading frames that begin with an initiation codon, usually AUG, and lie between 
the ribosome binding site and the initiation codon. Also, generally, there will be a translation 
stop codon at the end of the expressed polynucleotide and there will be a polyadenylation 
signal in constructs for use in eukaryotic hosts. Transcription termination signal appropriately 
disposed at the 3' end of the transcribed region may also be included in the polynucleotide 
construct. 

For secretion of the translated protein into the lumen of the endoplasmic reticulum, into the 
periplasmic space or into the extracellular environment, appropriate secretion signals may be 
incorporated into the expressed polynucleotide or polypeptide. These signals may be 
endogenous to the polynucleotide or they may be heterologous signals. Microbial cells 
employed in expression of proteins can be disrupted by any convenient method, including 
freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents, such 
methods are well know to those skilled in the art. PDE10A polynucleotide or polypeptide 
can be recovered and purified from recombinant cell cultures by well-known methods 
including ammonium sulfate or ethanol precipitation, acid extraction, anion or cation 
exchange chromatography, phosphocellulose chromatography, hydrophobic interaction 
chromatography, affinity chromatography, hydroxylapatite chromatography and lectin 
chromatography. Most preferably, high performance liquid chromatography is employed for 
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purification. Well known techniques for refolding protein may be employed to regenerate 
active conformation when the polynucleotide is denatured during isolation and or 
purification. 

In an embodiment, a nucleic acid molecule of the invention may be cloned into a glutathione 
S-transferase (GST) gene fusion system for example the pGEX-1 T, pGEX-2T and pGEX-3X 
of Pharmacia. The fused gene may contain a strong lac promoter, inducible to a high level of 
expression by IPTG, as a regulatory element. Thrombin or factor Xa cleavage sites may be 
present which allow proteolytic cleavage of the desired polypeptide from the fusion product. 
The glutathione S-transferase-PDElOA fusion protein may be easily purified using a 
glutathione sepharose 4B column, for example from Pharmacia. The 26 kd glutathione S- 
transferase polypeptide can be cleaved by thrombin (pGEX-1 or pGEX-2T) or factor Xa 
(pGEX-3X) and resolved from the using the polypeptide using the same affinity column. 
Additional chromatographic steps can be included if necessary, for example Sephadex or 
DEAE cellulose. The two enzymes may be monitored by protein and enzymatic assays and 
purity may be confirmed using SDS-PAGE. 

The PDE10A protein or parts thereof may also be prepared by chemical synthesis using 
techniques well known in the chemistry of proteins such as solid phase synthesis (Merrifield, 
1964) or synthesis in homogenous solution (Houbenweyl, 1987). 

Within the context of the present invention, PDE10A polypeptide includes various structural 
forms of the primary protein which retain biological activity. For example, PDE10A 
polypeptide may be in the form of acidic or basic salts or in neutral form. In addition, 
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individual amino acid residues may be modified by oxidation or reduction. Furthermore, 
various substitutions, deletions or additions may be made to the amino acid or nucleic acid 
sequences, the net effect being that biological activity of PDE10A is retained. Due to code 
degeneracy, for example, there may be considerable variation in nucleotide sequences 
encoding the same amino acid. 

The polypeptide may be expressed in a modified form, such as a fusion protein, and may 
include not only secretion signals but also additional heterologous functional regions. Thus, 
for instance, a region of additional amino acids, particularly charged amino acids, may be 
added to the C- or N-terminus of the polypeptide to improve stability and persistence in the 
host cell, during purification or during subsequent handling and storage. Also, fusion proteins 
may be added to the polynucleotide or polypeptide to facilitate purification. Such regions 
may be removed prior to final preparation of the polynucleotide or polypeptide. The addition 
of peptide moieties to polynucleotide or polypeptides to engender secretion or excretion, to 
improve stability or to facilitate purification, among others, are familiar and routine 
techniques in the art. In drug discovery, for example, proteins have been fused with antibody 
Fc portions for the purpose of high-throughput screening assays to identify antagonists (see 
Bennett et al., 1995, and Johanson et al.,1995). 

Detecting Presence of or Predisposition for CAG Repeat Disorders 

This invention is also related to the use of the PDE10A polynucleotides to detect 
complementary polynucleotides as a diagnostic reagent. Detection of the level of expression 
of PDE10A in a eukaryote, particularly a mammal, and especially a human, will provide a 
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method for diagnosis of a disease. Eukaryotes (herein also "individual(s)"), particularly 
mammals, and especially humans, exhibiting decreased levels of PDE10A may be detected 
by a variety of techniques. Nucleic acids for diagnosis may be obtained from an infected 
individual's cells and tissues, such as the striatum, nucleus accumbens and olfactory 
tubercule. RNA may be used directly for detection or may be amplified enzymatically by 
using PCR (Saiki et al., 1986) prior to analysis. As an example, PCR primers complementary 
to the nucleic acid encoding PDE10A can be used to identify and analyze PDE10A presence 
and/or expression. Using PCR, characterization of the level of PDE10A present in the 
individual may be made by comparative analysis. 

The invention thus provides a process for detecting disease by using methods known in the 
art and methods described herein to detect decreased expression of PDE10 polynucleotide. 
For example, decreased expression of PDE10 polynucleotide can be measured using any on 
of the methods well known in the art for the quantification of polynucleotides, such as, for 
example, PCR, RT-PCR, DNAse protection, northern blotting and other hybridization 
methods. Thus, the present invention provides a method for detecting triplet-repeat disorders, 
and a method for detecting a genetic pre-disposition for triplet-repeat disorders and other 
disorders of the basal ganglia including schizophrenia, stroke, trauma, Parkinson's disease 
and Alzheimer's disease (AD). More generally, the present invention provides a method for 
detecting a genetic pre-disposition for neurological disorders characterized by progressive 
cell loss. 
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Drug Screening Assays 

The invention also provides a method of screening compounds to identify those which 
enhance (agonist) or block (antagonist) the action of PDE10 polypeptides or polynucleotides, 
such as its interaction with PDElO-binding molecules. The identification of mutations in 
specific genes in inherited neurodegenerative disorders, combined with advances in the field 
of transgenic methods, provides those of skill in the art with the information necessary to 
further study human diseases. This is extraordinarily useful in modeling familial forms of 
triplet-repeat disorders and other disorders of the basal ganglia including schizophrenia, 
stroke, trauma, Parkinson's disease and Alzheimer's disease (AD). More generally, the 
present invention is useful for modeling neurological disorders characterized by progressive 
cell loss, as well as those involving acute cell loss, such as stroke and trauma. 

For example, to screen for agonists or antagonists, a synthetic reaction mix, a cellular 
compartment, such as a membrane, cell envelope or cell wall, or a preparation of any thereof, 
may be prepared from a cell that expresses a molecule that binds PDE10. The preparation is 
incubated with labeled PDE10 in the absence or the presence of a candidate molecule which 
may be a PDE10 agonist or antagonist. The ability of the candidate molecule to bind the 
binding molecule is reflected in decreased binding of the labeled ligand. 

PDElO-like effects of potential agonists and antagonists may by measured, for instance, by 
determining activity of a reporter system following interaction of the candidate molecule with 
a cell or appropriate cell preparation, and comparing the effect with that of PDE10 or 
molecules that elicit the same effects as PDE10. Reporter systems that may be useful in this 
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regard include, but are not limited to, colorimetric labeled substrate converted into product, a 
reporter gene that is responsive to changes in PDE10 activity, and binding assays known in 
the art. 

Another example of an assay for PDE10 antagonists is a competitive assay that combines 
PDE10 and a potential antagonist with membrane-bound PDElO-binding molecules, 
recombinant PDE10 binding molecules, natural substrates or ligands, or substrate or ligand 
mimetics, under appropriate conditions for a competitive inhibition assay. PDE10 can be 
labeled, such as by radioactivity or a colorimetric compound, such that the number of PDE10 
molecules bound to a binding molecule or converted to product can be determined accurately 
to assess the effectiveness of the potential antagonist. 

Potential antagonists include small organic molecules, peptides, polypeptides and antibodies 
that bind to a polynucleotide or polypeptide of the invention and thereby inhibit or extinguish 
its activity. Potential antagonists also may be small organic molecules, a peptide, a 
polypeptide such as a closely related protein or antibody that binds the same sites on a 
binding molecule, such as a binding molecule, without inducing PDElO-induced activities, 
thereby preventing the action of PDE10 by excluding PDE10 from binding. 

Potential antagonists include a small molecule which binds to and occupies the binding site 
of the polypeptide thereby preventing binding to cellular binding molecules, such that normal 
biological activity is prevented. Examples of small molecules include but are not limited to 
small organic molecules, peptides or peptide-like molecules. Other potential antagonists 
include antisense molecules (see Okano, 1988, for a description of these molecules). 
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Potential antagonists include compounds related to and derivatives of PDE10. 

Developing modulators of the biological activities of specific PDEs requires differentiating 
PDE isozymes present in a particular assay preparation. The classical enzymological 
approach of isolating PDEs from natural tissue sources and studying each new isozyme may 
be used. Another approach has been to identify assay conditions which might favor the 
contribution of one isozyme and minimize the contribution of others in a preparation. Still 
another approach has been the separation of PDEs by immunological means. Each of the 
foregoing approaches for differentiating PDE isozymes is time consuming. As a result many 
attempts to develop selective PDE modulators have been performed with preparations 
containing more than one isozyme. Moreover, PDE preparations from natural tissue sources 
are susceptible to limited proteolysis and may contain mixtures of active proteolytic products 
that have different kinetic, regulatory and physiological properties than the full length PDEs. 

Recombinant PDE 10 polypeptide products of the invention greatly facilitate the development 
of new and specific PDE10 modulators. The need for purification of an isozyme can be 
avoided by expressing it recombinantly in a host cell that lacks endogenous 
phosphodiesterase activity (e.g., yeast strain YKS45 deposited as ATCC 74225). Once a 
compound that modulates the activity of the PDE10 is discovered, its selectivity can be 
evaluated by comparing its activity on the PDE 10 to its activity on other PDE isozymes. 
Thus, the combination of the recombinant PDE 10 products of the invention with other 
recombinant PDE products in a series of independent assays provides a system for developing 
selective modulators of PDE10. Selective modulators may include, for example, antibodies 
and other proteins or peptides which specifically bind to the PDE 10 or PDE 10 nucleic acid, 
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oligonucleotides which specifically bind to the PDE10 (see Patent Cooperation Treaty 
International Publication No. WO93/05182 published Mar. 18, 1993 which describes 
methods for selecting oligonucleotides which selectively bind to target biomolecules) or 
PDE10 nucleic acid (e.g., antisense oligonucleotides) and other non-peptide natural or 
synthetic compounds which specifically bind to the PDE10 or PDE10 nucleic acid. Mutant 
forms of the PDE10 which alter the enzymatic activity of the PDE10 or its localization in a 
cell are also contemplated. Crystallization of recombinant PDE10 alone and bound to a 
modulator, analysis of atomic structure by X-ray crystallography, and computer modelling of 
those structures are methods useful for designing and optimizing non-peptide selective 
modulators. See, for example, Erickson et al., Ann. Rep. Med. Chem., 27: 271-289 (1992) for 
a general review of structure-based drug design. 

Targets for the development of selective modulators include, for example: (1) the regions of 
the PDE10 which contact other proteins and/or localize the PDE10 within a cell, (2) the 
regions of the PDE10 which bind substrate, (3) the allosteric cGMP-binding site(s) of 
PDE10, (4) the metal-binding regions of the PDE10, (5) the phosphorylation site(s) of PDE10 
and (6) the regions of the PDE10 which are involved in dimerization of PDE10 subunits. 

Thus, the present invention provides a method for screening and selecting compounds which 
promote triplet-repeat disorders, and a method for screening and selecting compounds which 
treat or inhibit triplet-repeat disorders, as well as schizophrenia, stroke, trauma, Parkinson's 
disease and Alzheimer's disease. More generally, the present invention provides a method for 
screening and selecting compounds which promote or inhibit neurological disorders 
characterized by progressive cell loss, as well as those involving acute cell loss, such as 
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stroke and trauma. 

The selected antagonists and agonists may be administered, for instance, to inhibit 
progressive and acute neurological disorders, such as Huntington's disease, Parkinson's 
disease, schizophrenia, Alzheimer's disease (AD), stroke or trauma. 

Antagonists and agonists and other compounds of the present invention may be employed 
alone or in conjunction with other compounds, such as therapeutic compounds. The 
pharmaceutical compositions may be administered in any effective, convenient manner 
including, for instance, administration by direct microinjection into the affected area, or by 
intravenous or other routes. These compositions of the present invention may be employed in 
combination with a non-sterile or sterile carrier or carriers for use with cells, tissues or 
organisms, such as a pharmaceutical carrier suitable for administration to a subject. Such 
compositions comprise, for instance, a media additive or a therapeutically effective amount of 
antagonists or agonists of the invention and a pharmaceutically acceptable carrier or 
excipient. Such carriers may include, but are not limited to, saline, buffered saline, dextrose, 
water, glycerol, ethanol and combinations thereof. The formulation is prepared to suit the 
mode of administration. 

Inhibition of PDE10A will be highly detrimental to striatal brain function. The progressive 
decline in PDE10A mRNA levels in HD may lead to dysregulation of c AMP levels and 
neuronal dysfunction. Up-regulation of PDE10A will be effective in combating such 
neuronal dysfunction. 
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Gene Therapy 



A variety of gene therapy approaches may be used in accordance with the invention to 
modulate expression of the PDE10A gene in vivo. For example, antisense DNA molecules 
may be engineered and used to block translation of PDE10A mRNA in vivo. Alternatively, 
ribozyme molecules may be designed to cleave and destroy the PDE10A rnRNAs in vivo. In 
another alternative, oligonucleotides designed to hybridize to the 5' region of the PDE10A 
gene (including the region upstream of the coding sequence) and form triple helix structures 
may be used to block or reduce transcription of the PDE10A gene. In yet another alternative, 
nucleic acid encoding the full length wild-type PDE10A message may be introduced in vivo 
into cells which otherwise would be unable to produce the wild-type PDE10A gene product 
in sufficient quantities or at all. 

In a preferred embodiment, the antisense, ribozyme and triple helix nucleotides are designed 
to inhibit the translation or transcription of PDE10A. To accomplish this, the 
oligonucleotides used should be designed on the basis of relevant sequences unique to 
PDE10A. 

For example, and not by way of limitation, the oligonucleotides should not fall within those 
region where the nucleotide sequence of PDE10A is most homologous to that of other PDEs, 
such as PDE2 PDE5 and PDE6, herein referred to as "unique regions". 

In the case of antisense molecules, it is preferred that the sequence be chosen from the unique 
regions. It is also preferred that the sequence be at least 18 nucleotides in length in order to 
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achieve sufficiently strong annealing to the target mRNA sequence to prevent translation of 
the sequence. Izant and Weintraub, 1984, Cell, 36:1007-1015; Rosenberg et al., 1985, Nature, 
313:703-706. 

In the case of the "hammerhead" type of ribozymes, it is also preferred that the target 
sequences of the ribozymes be chosen from the unique regions. Ribozymes are RNA 
molecules which possess highly specific endoribonuclease activity. Hammerhead ribozymes 
comprise a hybridizing region which is complementary in nucleotide sequence to at least part 
of the target RNA, and a catalytic region which is adapted to cleave the target RNA. The 
hybridizing region contains nine (9) or more nucleotides. Therefore, the hammerhead 
ribozymes of the present invention have a hybridizing region which is complementary to the 
sequences listed above and is at least nine nucleotides in length. The construction and 
production of such ribozymes is well known in the art and is described more fully in Haseloff 
and Gerlach, 1988, Nature, 334:585-591. 

The ribozymes of the present invention also include RNA endoribonucleases (hereinafter 
"Cech-type ribozymes") such as the one which occurs naturally in Tetrahymena Thermophila 
(known as the IVS, or L-19 IVS RNA) and which has been extensively described by Thomas 
Cech and collaborators (Zaug, et al., 1984, Science, 224:574-578; Zaug and Cech, 1986, 
Science, 231:470-475; Zaug, et al, 1986, Nature, 324:429-433; published International patent 
application No. WO 88/04300 by University Patents Inc.; Been and Cech, 1986, Cell, 47:207- 
216). The Cech endoribonucleases have an eight base pair active site which hybridizes to a 
target RNA sequence whereafter cleavage of the target RNA takes place. The invention 
encompasses those Cech-type ribozymes which target eight base-pair active site sequences 
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that are present in PDE10A but not other PDEs. 

The foregoing compounds can be administered by a variety of methods which are known in 
the art including, but not limited to the use of liposomes as a delivery vehicle. Naked DNA or 
RNA molecules may also be used where they are in a form which is resistant to degradation 
such as by modification of the ends, by the formation of circular molecules, or by the use of 
alternate bonds including phosphothionate and thiophosphoryl modified bonds. In addition, 
the delivery of nucleic acid may be by facilitated transport where the nucleic acid molecules 
are conjugated to poly-lysine or transferrin. Nucleic acid may also be transported into cells by 
any of the various viral carriers, including but not limited to, retrovirus, vaccinia, AAV, and 
adenovirus. 

Alternatively, a recombinant nucleic acid molecule which encodes, or is, such antisense, 
ribozyme, triple helix, or PDE10A molecule can be constructed. This nucleic acid molecule 
may be either RNA or DNA. If the nucleic acid encodes an RNA, it is preferred that the 
sequence be operatively attached to a regulatory element so that sufficient copies of the 
desired RNA product are produced. The regulatory element may permit either constitutive or 
regulated transcription of the sequence. In vivo, that is, within the cells or cells of an 
organism, a transfer vector such as a bacterial plasmid or viral RNA or DNA, encoding one or 
more of the RNAs, may be transfected into cells e.g. (Llewellyn et al., 1987, J. Mol. Biol, 
195:115-123; Hanahan et al. 1983, J. Mol. Biol, 166:557-580). Once inside the cell, the 
transfer vector may replicate, and be transcribed by cellular polymerases to produce the RNA 
or it may be integrated into the genome of the host cell. Alternatively, a transfer vector 
containing sequences encoding one or more of the RNAs may be transfected into cells or 
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introduced into cells by way of micromanipulation techniques such as microinjection, such 
that the transfer vector or a part thereof becomes integrated into the genome of the host cell. 

Composition, Formulation, and Administration of Pharmaceutical Compositions 

The pharmaceutical compositions of the present invention may be manufactured in a manner 
that is itself known, e.g., by means of conventional mixing, dissolving, granulating, dragee- 
making, levigating, emulsifying, encapsulating, entrapping or lyophilizing processes. 

Pharmaceutical compositions for use in accordance with the present invention thus may be 
formulated in conventional manner using one or more physiologically acceptable carriers 
comprising excipients and auxiliaries which facilitate processing of the active compounds 
into preparations which can be used pharmaceutically. Proper formulation is dependent upon 
the route of administration chosen. 

For injection, the agents of the invention may be formulated in aqueous solutions, preferably 
in physiologically compatible buffers such as Hanks's solution, Ringer's solution, or 
physiological saline buffer. For transmucosal administration, penetrants appropriate to the 
barrier to be permeated are used in the formulation. Such penetrants are generally known in 
the art. 

For oral administration, the compounds can be formulated readily by combining the active 
compounds with pharmaceutically acceptable carriers well known in the art. Such carriers 
enable the compounds of the invention to be formulated as tablets, pills, dragees, capsules, 
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liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a patient to be 
treated. Pharmaceutical preparations for oral use can be obtained solid excipient, optionally 
grinding a resulting mixture, and processing the mixture of granules, after adding suitable 
auxiliaries, if desired, to obtain tablets or dragee cores. Suitable excipients are, in particular, 
fillers such as sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose preparations 
such as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, gum 
tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium 

carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP). If desired, disintegrating agents 
may be added, such as the cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt 
thereof such as sodium alginate. 

Dragee cores are provided with suitable coatings. For this purpose, concentrated sugar 
solutions may be used, which may optionally contain gum arabic, talc, polyvinyl pyrrolidone, 
carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable 
organic solvents or solvent mixtures. Dyestuffs or pigments may be added to the tablets or 
dragee coatings for identification or to characterize different combinations of active 
compound doses. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 
gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or 
sorbitol. The push-fit capsules can contain the active ingredients in admixture with filler such 
as lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate and, 
optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended 
in suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In 
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addition, stabilizers may be added. All formulations for oral administration should be in 
dosages suitable for such administration. 

For buccal administration, the compositions may take the form of tablets or lozenges 
formulated in conventional manner. 

For administration by inhalation, the compounds for use according to the present invention 
are conveniently delivered in the form of an aerosol spray presentation from pressurized 
packs or a nebulizer, with the use of a suitable propellant, e.g., dichlorodifluoromethane, 
trichlorofiuoromethane, dichlorotetrafluoro ethane, carbon dioxide or other suitable gas. In the 
case of a pressurized aerosol the dosage unit may be determined by providing a valve to 
deliver a metered amount. Capsules and cartridges of e.g. gelatin for use in an inhaler or 
insufflator may be formulated containing a powder mix of the compound and a suitable 
powder base such as lactose or starch. 

The compounds may be formulated for parenteral administration by injection, e.g., by bolus 
injection or continuous infusion. Formulations for injection may be presented in unit dosage 
form, e.g., in ampoules or in multidose containers, with an added preservative. The 
compositions may take such forms as suspensions, solutions or emulsions in oily or aqueous 
vehicles, and may contain formulatory agents such as suspending, stabilizing and/or 
dispersing agents. 

Pharmaceutical formulations for parenteral administration include aqueous solutions of the 
active compounds in water-soluble form. Additionally, suspensions of the active compounds 
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may be prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or 
vehicles include fatty oils such as sesame oil, or synthetic fatty acid esters, such as ethyl 
oleate or triglycerides, or liposomes. Aqueous injection suspensions may contain substances 
which increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, 
sorbitol, or dextran. Optionally, the suspension may also contain suitable stabilizers or agents 
which increase the solubility of the compounds to allow for the preparation of highly 
concentrated solutions. 

Alternatively, the active ingredient may be in powder form for constitution with a suitable 
vehicle, e.g., sterile pyrogen-free water, before use. 

The compounds may also be formulated in rectal compositions such as suppositories or 
retention enemas, e.g., containing conventional suppository bases such as cocoa butter or 
other glycerides. 

In addition to the formulations described previously, the compounds may also be formulated 
as a depot preparation. Such long acting formulations may be administered by implantation 
(for example subcutaneously or intramuscularly) or by intramuscular injection. Thus, for 
example, the compounds may be formulated with suitable polymeric or hydrophobic 
materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as 
sparingly soluble derivatives, for example, as a sparingly soluble salt. 

A pharmaceutical carrier for the hydrophobic compounds of the invention is a cosolvent 
system comprising benzyl alcohol, a nonpolar surfactant, a water-miscible organic polymer, 
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and an aqueous phase. Naturally, the proportions of a co-solvent system may be varied 
considerably without destroying its solubility and toxicity characteristics. Furthermore, the 
identity of the co-solvent components may be varied. 

Alternatively, other delivery systems for hydrophobic pharmaceutical compounds may be 
employed. Liposomes and emulsions are well known examples of delivery vehicles or 
carriers for hydrophobic drugs. Certain organic solvents such as dimethylsulfoxide also may 
be employed, although usually at the cost of greater toxicity. Additionally, the compounds 
may be delivered using a sustained-release system, such as semipermeable matrices of solid 
hydrophobic polymers containing the therapeutic agent. Various of sustained-release 
materials have been established and are well known by those skilled in the art. Sustained- 
release capsules may, depending on their chemical nature, release the compounds for a few 
weeks up to over 100 days. Depending on the chemical nature and the biological stability of 
the therapeutic reagent, additional strategies for protein stabilization may be employed. 

The pharmaceutical compositions also may comprise suitable solid or gel phase carriers or 
excipients. Examples of such carriers or excipients include but are not limited to calcium 
carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, gelatin, and 
polymers such as polyethylene glycols. 

Many of the compounds of the invention may be provided as salts with pharmaceutically 
compatible counterions. Pharmaceutically compatible salts may be formed with many acids, 
including but not limited to hydrochloric, sulfuric, acetic, lactic, tartaric, malic, succinic, etc. 
Salts tend to be more soluble in aqueous or other protonic solvents that are the corresponding 
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free base forms. 



Suitable routes of administration may, for example, include oral, rectal, transmucosal, 
transdermal, or intestinal administration; parenteral delivery, including intramuscular, 
subcutaneous, intramedullary injections, as well as intrathecal, direct intraventricular, 
intravenous, intraperitoneal, intranasal, or intraocular injections. 

Alternately, one may administer the compound in a local rather than systemic manner, for 
example, via injection of the compound directly into an affected area, often in a depot or 
sustained release formulation. 

Furthermore, one may administer the drug in a targeted drug delivery system, for example, in 
a liposome coated with an antibody specific for affected cells. The liposomes will be targeted 
to and taken up selectively by the cells. 

The pharmaceutical compositions generally are administered in an amount effective for 
treatment or prophylaxis of a specific indication or indications. It is appreciated that 
optimum dosage will be determined by standard methods for each treatment modality and 
indication, taking into account the indication, its severity, route of administration, 
complicating conditions and the like. In therapy or as a prophylactic, the active agent may be 
administered to an individual as an injectable composition, for example as a sterile aqueous 
dispersion, preferably isotonic. A therapeutically effective dose further refers to that amount 
of the compound sufficient to result in amelioration of symptoms associated with such 
disorders. Techniques for formulation and administration of the compounds of the instant 
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application may be found in "Remington's Pharmaceutical Sciences," Mack Publishing Co., 
Easton, Pa., latest edition. For administration to mammals, and particularly humans, it is 
expected that the daily dosage level of the active agent will be from 0.001 mg/kg to 10 
mg/kg, typically around 0.01 mg/kg. The physician in any event will determine the actual 
dosage which will be most suitable for an individual and will vary with the age, weight and 
response of the particular individual. The above dosages are exemplary of the average case. 
There can, of course, be individual instances where higher or lower dosage ranges are 
merited, and such are within the scope of this invention. 

The invention further provides diagnostic and pharmaceutical packs and kits comprising one 
or more containers filled with one or more of the ingredients of the aforementioned 
compositions of the invention. Associated with such container(s) can be a notice in the form 
prescribed by a governmental agency regulating the manufacture, use or sale of 
pharmaceuticals or biological products, reflecting approval by the agency of the manufacture, 
use or sale of the product for human administration. 

EXAMPLES 

The present invention is further described by the following examples. These examples, 
while illustrating certain specific aspects of the invention, do not portray the limitations or 
circumscribe the scope of the disclosed invention. 
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EXAMPLE 1 - Isolation of PDE10A 

Wild-type (B6CBAF1) and HD transgenic [B6CBA-TgN(Hdexonl)62Gpb] mice (Jackson 
Laboratories) and adult Sprague-Dawley rats (250-300 g; Charles River Laboratories) and 
were used in this study. The genotype of the mice was determined by PCR amplification of a 
100 bp region of the integrated human HD exon 1 transgene using primers corresponding to 
nts 3340-3459 (5'-AGG GCT GTC AAT CAT GCT GG-3') andnts 3836-3855 (5 '-AAA 
CTC ACG GTC GGT GCA GC-3') of clone E4.1 of the human HD gene (Accession number 
L34020). PCR conditions used are described in Mangiarini et al.(1996). DNA was extracted 
from a tail clip and an ear punch from each mouse used in this study. Both samples were 
subjected to PCR genotype analysis. For in situ hybridization analysis, the animals were 
anesthetized with >100 mg/kg sodium pentobarbital, decapitated, the brains removed and 
stored at -70°C prior to sectioning. For RNA isolation, animals were anesthetized, 
decapitated and the striatum and cortex were excised and stored in liquid nitrogen prior to 
RNA extraction. Animal care was given according to protocols approved by Dalhousie 
University and the Canadian Council of Animal Care. 

Differential display was used to identify novel mDNA or previously described mDNA whose 
relative expression levels are altered as a result of the presence of the transgene. Using 
differential display, the mRNA populations derived from the striatum of 10 week old wild 
type were compared with age-matched R6/2 transgenic mice. Differential display has been 
used extensively (> 750 references) since its development (Liang and Pardee, 1992) to 
identify changes in gene expression in cells and in tissues including brain (Douglass et al., 
1995; Babity et al., 1997a; Livesey et al., 1997; Berke et al., 1998). Perhaps the most 
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important finding was the demonstration by Qu et al., (1996) that differential display can be 
used to isolate genes differentially expressed in inbred strains of mice. The power of 
differential display is that the sequence information obtained can be directly related to the 
experimental paradigm. Moreover, such sequence information includes sufficient 
information to identify transcripts and can then lead to experiments that reveal function of the 
cognate protein in the experimental model. 

DNA sequence information of potentially differentially expressed cDNA can be used to 
generate oligonucleotide probes for in situ hybridization to define the anatomical and 
temporal patterns of expression of specific transcripts (see Babity et al., 1997a). This 
technique is especially useful to study changes in steady-state levels of mRNA in 
heterogeneous tissue such as brain. Brain tissue can be micro-dissected (Babity et al., 
1997b). This enabled the present inventors to reduce the requirement for tissue, and hence 
compare the mRNA populations derived from individual animals for each experimental 
group. 

Thus RT-PCR (Denovan- Wright et al., 1999) was used to identify differences in the patterns 
of gene expression between the striatum of wild-type and transgenic mice that were 
hemizygous for the 5' UTR, exon 1 and part of intron 1 of the human Huntingon's Disease 
gene. Total cellular RNA was isolated from the striatum and cortex of three 10 week-old 
wild-type and three 10 week-old R6/2 HD mice (Mangiarini et al., 1996) and used as the 
template to generate single-stranded cDNA. Total cellular RNA from each animal and tissue 
was purified using Trizol™ reagent (Gibco BRL) and the manufacture's protocol. 10 (ig 
aliquots of total RNA were treated with RQ1 DNAse-free DNAse (Promega) in the presence 
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of DNAsin™ (Promega) DNAse inhibitor to remove trace genomic DNA and then converted 
to single-stranded cDNA. The primers and conditions for PCR amplification follow those of 
the Delta™ RNA fingerprinting manual (Clontech). 

The cDNA was then used as the substrate for PCR reactions using 57 differential display 
primer combinations. The radio-labelled PCR products were fractionated on a denaturing 
acrylamide sequencing gels using a Genomyx LR™ sequencing apparatus, transferred to 
3MM filter paper and dried. The dried acrylamide gels were exposed to autoradiography film 
(BioMax MR™) overnight. After fractionating the radio-labelled PCR products on denaturing 
acrylamide gels, it was found that the overwhelming majority of the approximately 18,000 
PCR products screened were common to both the wild-type and HD mice (data not shown). 
One PCR product, amplified using the primers P7 (5'-ATT AAC CCT CAC TAA ATG CTG 
TAT G- 3') and T6 (5'- CAT TAT GCT GAG TGA TAT CTT TTT TTT TCG- 3') of 
approximately 500 bp, was observed in each of three samples derived from the striatum of 
wild-type mice (FIG. 1). This 500 bp band was absent from the samples derived from the 
striatum of the HD mice (FIG. 1) and was absent from each of the samples derived from the 
cortical tissue (data not shown). 

FIG. 1 shows the Down-regulated in Huntington's Disease (PDE10A) transcript, identified 
by differential display RT PCR. A band of approximately 500 bp (arrow) was amplified from 
cDNA made form 10 week-old wild-type but not 10 week-old HD striatal tissue. Total RNA 
from individual animals (numbered 1-6) was used as the substrate for the generation of 
single-stranded cDNA. Animals 1, 2 and 3 were transgenic HD mice. Animals 4, 5 and 6 
were wild-type mice. 
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EXAMPLE 2 - Cloning of PDE10A 

The 500 bp band, designate PDElOApcr, was excised from the dried gel and rehydrated in 40 

of H 2 0 for 10 min at room temperature. The eluted DNA was subjected to PCR re- 
amplification using the P7 and T6 primers, rTaq polymerase (Pharmacia) and the following 
conditions: 60" @ 94°C, 19 x (30" @ 94°C, 30" @ 58°C, 120" @ 68°C + 4" per cycle), T @ 
68°C. The PCR reaction was subjected to agarose gel electrophoresis and the 500 bp band 
was removed from the gel, extracted from the agarose using the Qiagen gel extraction 
protocol and cloned into the vector, pGem-T using standard methods. Plasmid DNA was 
isolated from selected transformants using Qiagen spin columns. The resultant clone was 
named pPDElOA. 

EXAMPLE 3 - Identification of PDE10A 

The cloned insert of pPDElOA was radio-labelled and used as a hybridization probe in 
northern blot analysis (FIG. 2). Northern blots of total RNA were prepared using the method 
described in Denovan- Wright et al. (1998). The 500 bp cloned insert of PDE10A was radio- 
labelled with [a-32P]dCTP (3000 Ci/mmol) using the Ready-to-Go dCTP beads (Pharmacia). 
Northern blot hybridization, brain tissue preparation and in situ hybridization are described in 
Denovan- Wright et al. (1998). The 500 bp cloned insert of pPDElOA annealed to a transcript 
of approximately 9.5 kb in total RNA isolated from the striatum often week-old wild-type 
mice. 
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FIG. 2 demonstrates that PDE10A is expressed in the striatum but not the cortex of wild-type 
mice and the steady-state levels of PDE10A are reduced in 10 week old transgenic HD mice. 
The differential expression of PDE10A in HD mice was confirmed by northern blot analysis. 
The cloned insert of pPDElOA was radio-labelled and used as a hybridization probe in 
northern blot analysis. The northern blot was prepared by size-fractionating total RNA from 
the striatum and cortex of three individual 10 week-old HD (1, 2 and 3) and wild-type (4, 5 
and 6) mice. Following the hybridization of pPDElOA, the radio-label was removed and the 
blot was subsequently allowed to hybridize with a probe that detects constituitively expressed 
cyclophilin. The hybridization pattern of the cyclophilin probe is aligned below the northern 
blot demonstrating that equivalent amount of RNA were present in each lane. The relative 
mobility of RNA molecular weight standards (RNA ladder, Gibco BRL) are shown on the 
left of the northern blot. 

The hybridization signal of pPDElOA was significantly lower in the RNA samples derived 
from the striatum of 10 week-old HD mice. No expression of the PDE10A mRNA was 
detected in the cortical RNA samples derived from either the wild-type or HD mice. 

EXAMPLE 4 - Sequencing PDE10A 

The sequence of the cloned differential display band, pPDElOA, was determined using M13 
universal forward and reverse sequencing primers and the T7 sequencing kit (Pharmacia). 
The 484 bp cDNA fragment did not have sequence similarity to any Genbank entries. 

FIG. 3 shows the nucleotide sequence of the cloned PDE10A differential display product, 
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pPDElOA. The position of the primers used to amplify the fragment are underlined and 
labelled. The nucleotide sequence and position of oligonucleotide probes 1 and 2 within the 
pPDElOA sequence are shown. 

EXAMPLE 5 - Isolation and Characterization of cDNA PDE10A 

In order to isolate PDE10A cDNA clones, oligonucleotide probes 1 and 2 were used in 5' and 
3' Rapid Amplification of cDNA Ends (RACE) reactions using commercially prepared 
RACE-ready mouse striatal cDNA (Clontech). Several independent clones were isolated and 
those that contained the sequence of pPDElOA were selected for further analysis. Each of the 
5' RACE clones was identical in sequence over the length that the clones could be aligned. 
The difference in length between these clones is a result of termination of the original 
reverse-transcriptase reaction at different positions along the mRNA. No difference in size or 
sequence was detected between several 3' RACE clones. The longest 5' RACE clone and 
one 3' RACE clone were completely sequenced using internal primers. The present inventors 
were able to isolate a very short clone that extended the 5 ' RACE clone using an internal 
primer (probe 3, 5'- CTA TTT CAC AAG AGA CTG ACC AGC CAA TAA ATC TC- 3'). 
The compiled sequence of the first PDE10A cDNA clone, named cPDE10A-l is presented in 
FIG. 10. cPDElOA-1 is 3235 bp in length. The restriction map of cPDE10A-l is shown in 
FIG. 11. 

The mRNA that hybridized with pPDElOA was approximately 9.5 kilobases in length. In 
order to obtain PDE10A cDNA clone that was larger than cPDE10-l, the present inventors 
screened a mouse brain cDNA library. Several clones were identified that hybridized with 
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the pPDEl 0 probe. The sequence of the largest of these cDNA clones, cPDElO-2, was. 
determined. The sequence (FIG. 12) was 5753 base pairs in length. The restriction map of 
cPDElO-2 is shown in. FIG-. 13. 

cPDE10-l and cPDE10-2 share sequence identity over 2095 bp. However, ihc 5' 1 142 bp of 
cPDElO-l and the 5' 1689 bp of cPDE10-2 arc unique to each done. Clone cPDElO-2 
extends 1969 bp in the 3' direction compared to cPDE10-l. A schematic showing the regions 
of sequence identity and the unique sequences of cPDE10-l and -2 are shown m FIG. 14 

The compiled sequence of the mouse PDE10 cDNA clone, named cPDEl OA, is presented in 
FIG. 1 5 with RACEs. A further sequence, without RACEs, is shown in FIG. 1 9. The coding 
sequence and restriction map of cPDEl OA is shown in FIG. 16. and updated at FIG. 17. FIG. 
1 S is a restriction map of PDE10A. The coding region has a met initiator commencing at 
nucleotide 257, wnh a slop codon ending at nucleotide 2596. 

PDE10A was found to have extremely high homology with human PDElOs identified by 
Loughney et al. : W099/42596, the contents of which are incorporated herein by reference. 

EXAMPLE 6 - Localization of PDE10A m the Brain 

In order to identify the coding strand and to localize the transcript m the wild-type mouse 
brain, two oligonucleotide probes were designed (probe 1,5'- GAA CAT GTA GCA TAT 
ACT CCA GAC AAC AGA TCA TAT GG-3': probe 2, 5' - CAG CTT CTC CAC AGG 
AAC ACA GTA ACA AAG AG -3') that were complementary to different regions and 
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strands of the 484 bp pPDElOA clone. These oligonucleotides were used for in situ 
hybridization analysis. Using high stringency post in situ hybridization washes (2 x 30' in IX 
SSC @ 58°C, 4 x 15' in IX SSC @ 58°C, 4 x 15' in 0.5X SSC @ 58°C, 4 x 15' in 0.25X SSC 
@ 58°C), it was found that oligonucleotide probe 1 annealed with mRNA in the striatum, 
nucleus accumbens and olfactory tubercule often week-old wild-type mice (FIG. 4). The 
hybridization signal was significantly reduced in the striatum, nucleus accumbens and 
olfactory tubercle of the 10 week-old HD mice (FIG. 4). 

FIG. 5 shows in situ hybridization of probe 1 to coronal (top three sections) and saggital 
(bottom section) 10 week-old wild-type (WT) and HD mouse brain sections. Specific 
hybridization of the probe was observed in the striatum, nucleus accumbens and olfactory 
tubercle of wild-type mice. The top three sections represent the distribution of PDE10A 
throughout the rostral-caudal axis of the striatum. 

The in situ hybridization results confirmed the northern blot analysis demonstrating, 1) that 
the expression of PDE10A mRNA was restricted to the striatum, nucleus accumbens and 
olfactory tubercle and 2) that the levels of PDE10A mRNA were decreased in HD mice 
compared to the wild-type. The probe did not anneal with mRNA in any other brain nuclei. 
No hybridization of oligonucleotide probe 2 was observed in any region of the brain in wild- 
type or HD mice (Fig. 3). Based on this hybridization, the coding strand, complementary to 
probe 1, of pPDElOA was defined. 
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EXAMPLE 7 - Characterization of PDE10 

The in situ hybridization using oligonucleotide probe 1 demonstrated that PDE10A mRNA 
levels in the striatum , nucleus accumbens and olfactory tubercule were decreased in ten 
week- old HD mice. By ten weeks of age, the HD mice all showed motor symptoms 
including resting tremor and stereotypic involuntary movements. Moreover, these mice 
immediately clasped their feet together and curled into a tight ball when picked up by their 
tails. 

As the phenotypic signs are progressive over a number of weeks, the present inventors 
examined whether the PDE10A transcript was ever expressed in the striatum of the HD mice 
or whether the steady-state levels of the transcript diminished in the striatum in a course that 
parallelled the development of the motor disorders. Wild-type and HD mice were sacrificed 
at 5, 7 and 8 weeks of age and their brains were prepared for in situ hybridization analysis 
using probe 1 (FIG. 5). 

FIG. 5 shows the levels of PDE10A mRNA decrease in HD mice over the period of time that 
the HD mice develop abnormal movements and postures. In situ hybridization analysis of 
coronal and saggital sections of wild-type and HD mouse brain using oligonucleotide probe 1 
which is complementary to the coding strand of PDE10A. At 5 weeks of age, before the 
development of motor symptoms, the HD mice express the PDE10A transcript in the same 
brain nuclei and at the same relative levels as wild-type mice. The steady-state level 
PDE10A decreases in the striatum, nucleus accumbens and olfactory tubercle from 5 to 10 
weeks in the HD but not wild-type mice. By 9 weeks of age, the HD mice have abnormal 
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movement and posture. The numbers refer to the age in weeks of the wild-type (WT) and 
Huntington's (HD) transgenic mice. 

None of the mice at these ages had overt motor symptoms. Sections taken throughout the 
rostral-caudal axis of the striatum showed that PDE10A was expressed in the 5 week-old 
wild-type and HD mice. The relative hybridization of probe 1 did not change in 5, 7, 8 and 
10 week-old wild-type mice. The intensity of the hybridization signal appeared to decrease in 
the striatum, nucleus accumbens and olfactory tubercle of HD mice from 5 to 10 weeks 
compared to their wild-type litter mates (FIG. 5). 

The levels of PDE10A were significantly reduced by 8 weeks of age in the HD mice, using 
two in situ oligonucleotide probes, one complementary to the 3' UTR, the second 
complementary to an internal portion of the coding region. The hybridization pattern 
observed in the wild-type and HD mice was the same for both the probes employed. This 
analysis demonstrated that there is a reduction in the complete PDE10A mRNA levels during 
the development of the HD phenotype and not that there was a differential reduction in the 
PDE10A coding region as compared to the extensive 3' UTR. Moreover, in situ hybridization 
using the PDE1 OA-specific probe against neurologically normal and HD human brain tissue 
demonstrated that there was a decrease in PDE10A levels in human HD patients. 

One day old wild-type and HD mice were frozen, sectioned on a cryostat and whole mouse 
sections were prepared for in situ hybridization using probe 1 . The same high stringency 
post-hybridization washing conditions were employed for the one day-old mouse body 
sections as were used for the adult mouse brain sections. Parallel in situ hyridization 
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experiments using the probe 2 were performed in order to determine the level of non-specific 
signal in the mouse sections. Probe 1 specifically annealed to the developing striatum (FIG. 
6). 



FIG. 6 demonstrates that PDE10A is expressed in the developing striatum of one day-old 
wild-type and HD mice. The sections on the left were subjected to in situ hybridization using 
probe 1 . Following hybridization, the sections were counter-stained with cresyl violet to 
visualize the mouse organs. The signal outside the brain was non-specific as probe 2 and 
other unrelated control oligonucleotide probes all labelled these tissues. 

There was no difference in the pattern of hybridization between the one day-old wild-type 
and HD mice demonstrating that PDE10A was expressed in the developing brain of both 
wild-type and HD mice. 

Following in situ hybridization, the sections were covered in autoradiographic emulsion, left 
in the dark to expose for 4 weeks and then developed and viewed under dark-field 
microscopy or, after counter-staining the sections with cresyl violet to visualize neuronal cell 
bodies, under bright- field microscopy. Silver grains were observed to be concentrated in the 
striatum of the wild-type mice. FIG. 7 shows emulsion autoradiography of mouse brain 
sections following in situ hybridization of probe 1 demonstrated that the PDE10A transcript 
is expressed in neurons. PDE10A is not homogeneously distributed throughout the mouse 
striatum. Dark field illumination of the sections after emulsion autoradiography showed that 
the silver grains were clustered in specific regions of the 10 week old wild-type mouse 
striatum (A and C). Sections from 10 week old HD mice subjected to identical in situ and 
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emulsion autoradiographic conditions are shown in B and D. The photomicrographs shown 
in A and B were viewed using the 10X objective (bar represents 100 pro). The micrographs 
shown in C and D, were viewed under the 20X objective (bar represents 25 fxm). The insert 
in panel C is a portion of the section in A and C counter- stained with cresyl violet to visualize 
the neurons, viewed using the 40X objective under bright filed illumination. Note the 
distribution of the silver grains over some, but not all, of the striatal neurons as well as being 
concentrated around clusters of neurons. It appeared that the silver grains were absent from 
fibre tracks within the striatum. It appeared that PDE10A mRNA was not confined to regions 
close to the nucleus but was dispersed in cellular processes. 

Huntingtin with an expanded polyglutamine tract (htt-HD) is expressed in neurons of the 
brain and body throughout development and during the lifetime of HD patients (The 
Huntington's Disease Research Collaborative, 1993; Ross, 1995). Transgenic HD mice 
express a portion of htt-HD and develop a phenotype with many of the symptoms of HD after 
a period of normal development and growth (Carter et al., 1999; Cha et al., 1998; Mangiarini 
et al., 1996). Using differential display RT PCR, northern blot and in situ hybridization, we 
have demonstrated that PDE10A mRNA levels decline in the striatum of HD mice. This 
specific member of the PDE multigene family is highly expressed in the striatum and 
olfactory tubercle of mice (Soderling et al., 1999) and rats (Fujishige et al., 1999) and in the 
caudate and putamen of humans (Fujishige et al., 1999; Loughney et al, 1999). The levels of 
PDE10A were the same in 5 week old wild-type and HD mice. PDE10A mRNA levels then 
began to decline and were almost undetectable in the striatum and olfactory tubercle by the 
time the mice reached 8 weeks of age. This time coincides with the onset of overt motor 
symptoms in the HD mice indicating that the loss of PDE10A in striatal neurons leads to 
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dysfunction of the nuclei that control movement. The R6/2 mice develop the HD phenotype 
in the absence of cell death. The decrease in PDE10A mRNA, therefore, is not due to the 
loss of PDE1 OA-expressing cells but rather a change in steady-state RNA levels that occurs 
due to the expression of mutant huntingtin. 

The particular isoform that decreases in HD is PDE10A. PDE10A has been cloned from 
human lung and fetal brain cDNA libraries (Fujishige et al., 1999; Loughney et al., 1999). It 
appears that the presence of the expanded polyglutamine tract in huntingtin alters gene 
expression in the striatum, and that this is the mechanism by which only a small group of 
neurons in the striatum and cortex are rendered vulnerable to this ubiquitously expressed 
mutant protein. 

EXAMPLE 8 - PDE10A is Highly Conserved Among Mammalian Species 

The oligonucleotide (probe 1) complementary to the coding strand of the PDE10A transcript, 
was also used as an in situ hybridization probe against coronal brain sections derived from 
adult rats. FIG. 8 shows in situ hybridization analysis of adult rat brain sections using 
oligonucleotide probe 1 complementary to the coding- strand of PDE10A revealed that the 
pattern of expression of PDE10A is the same in rats and mice. The hybridization conditions 
used to detect the rat homologue of PDE10A in rat brain tissue differed from those used to 
detect the transcript in mice only in that the stringency of the post-hybridization washes were 
reduced. 

No hybridization was observed in the rat striatum using the post-hybridization washes 
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employed following the in situ hybridization of mouse brain sections. However, when the 
stringency of the post-hybridization washes was lowered (2 x 60' in IX SSC @ 42°C, 2 x 60' 
in 0.5X SSC @ 42°C, 2 x 60' in 0.25X SSC @ room temperature), the PDE10A 
oligonucleotide probe specifically labelled the adult rat striatum, nucleus accumbens and 
olfactory tubercule in a pattern indistinguishable from that observed in mouse brain sections. 
It appears, therefore, that a transcript which shares nucleotide sequence and expression 
pattern is present in both mice and rats. The evolutionary conservation of PDE10A suggests 
that it is important for normal function of the basal ganglia. 

By northern blot, Fujishige et al. (1999) demonstrated that PDE10A is expressed in human 
fetal brain. The homology between mouse and human PDE10A is extremely high (data not 
shown). 

EXAMPLE 9 - Analysis of PDE10A in Genomic DNA 

Because the transgenic mice employed in this study have a copy of the human HD 5' UTR, 
exon 1 with expanded CAG repeat and 262 bp of the intron 1 that has been integrated into an 
undefined locus of the mouse genome, it was possible that the integration event disrupted the 
PDE10A gene preventing its expression in the HD mouse striatum. Genomic DNA was 
isolated from wild-type and HD mice and subjected to Southern blot analysis. 

Genomic DNA was isolated from wild-type and HD mice and subjected to Southern blot 
analysis using pPDElOA as a hybridization probe. The size of the BamEl and EcoKL 
fragments that are present in the transgenic R6/2 line that correspond to the insertion of the 
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human exon 1 gene fragment are 1.9 and 0.8 (BamHl) and 1.9 (EcoRi) kb. Analysis of the 
size of the fragments that hybridized with pPDElOA demonstrated that there was no 
difference in the size of the hybridizing fragments between the wild-type and HD mice. FIG. 
9 shows the genomic DNA restriction fragments that hybridized with pPDElOA were the 
same in wild-type and HD mice. The size of the hybridizing BamHl and EcoRl fragments in 
each genomic DNA sample is approximately 8 kb and 3 kb, respectively. If the 1 .9 kb Sacl- 
EcoTZI HD gene fragment integrated into the genome within the BamHl and EcoRl fragments 
that hybridized with the DHDM cDNA cloned insert, the sizes of the HD hybridizing bands 
would have been distinct from those of the wild-type. This Southern blot analysis indicates 
that the gene encoding PDE10A is present as a single-copy in the mouse genome. The 
numbers at the left of the blot are the relative mobility of molecular weight markers (1 kb 
ladder, BioRad). 

The PDE10A cDNA has since been cloned using a bioinformatics search strategy involving 
screening of the expressed sequence tag (EST) database for novel PDE cDNA clones. 
Independently, the mouse PDE 1 OA cDNA was identified after an EST search for novel PDEs 
with conserved cGMP binding domains (Soderling et al., 1999). The rat isoforms of 
PDE10A and splice variants have also been described (Fujishige et al., 1999). Human, 
mouse and rat PDE10A splice variants differ in their 5' untranslated and part of the 5' coding 
region but are identical in the coding region when the various splice variants are compared 
within each species. The human, mouse and rat PDE10A coding regions contain 779, 779 
and 794 amino acids, respectively, encoding a protein of approximately 88.5 kDa. 
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EXAMPLE 10 - Distribution of PDE10A 

In mouse, PDE10A mRNA was detected in testis and to a much lesser extent in brain but not 
in heart, spleen, lung, liver, skeletal muscle, kidney, ovary, pancreas, smooth muscle, eye or 
in total RNA isolated from 7, 1 1, 15 or 17 day old embryo (Soderling et al., 1999). This data 
agrees with the PDE10A mRNA pattern of distribution that we observed in wild-type and 
pre- symptomatic HD mice. In mice, two different size transcripts are detected in northern 
blots using the coding region as a probe. In mouse testis, the most abundant transcript is 
approximately 4 kb. A 9.5 kb transcript was also detected in mouse testis. It appears that the 
most abundant transcript in mouse brain is 9.5 k. Similarly, two sized PDE10A transcripts 
were observed in rats, however, it appears that, in rat, the 4 kb mRNA is expressed 
exclusively in testis while the 9.5 kb mRNA is expressed exclusively in brain (Fujishige et 
al., 1999). Within the brain, the rat PDE10A mRNA was expressed in striatum and olfactory 
tubercle and not cortex, cerebellum, hippocampus, midbrain or brainstem. In humans, 
PDE10A is expressed in the caudate, putamen and testis. As was observed in rodents, 
mRNAs of approximately 4 and 10 kb hybridized with the PDE10A probe. Again, it appears 
that, although both sized transcripts are present in brain and testis, the larger mRNA is 
predominant in the caudate and putamen and the smaller sized transcript is present in the 
testis. Each of the mouse, rat and human PDE10A sequences are not longer than 4 kb and 
span the coding region and parts of the 3' UTR. The difference in abundance of the short and 
long transcript in the testis and brain, respectively, in all three species suggest that the 3' UTR 
functions to provide transcript stability in the brain. As such, we present the complete 
sequence of the brain-specific transcript of PDE10A derived from mouse. 
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EXAMPLE 1 1 - Modulating Activity of PDE10A Using cGMP-PDE Activity 

Cyclic nucleotides are the predominant second messengers that activate cellular signaling 
pathways (Beavo, 1995; Conti and Jin, 1999). The concentration of intracellular cyclic 
nucleotides is dependent on their rate of synthesis by adenyl and guanyl synthase, the rate of 
efflux from the cell, and the rate of degradation. PDEs hydrolyze cAMP and cGMP limiting 
both the duration and amplitude of the cyclic nucleotide signal (Beavo, 1995; Conti and Jin, 
1999). In mammals, PDEs are encoded by a large multigene family. The various PDE 
family members have tissue-specific patterns of expression (Conti and Jin, 1999). PDEs have 
also been described in Caenorhabditis, Drosophila, Dictyostelium, Saccharomyces, Candida 
and Vibrio species demonstrating that this enzyme has been conserved throughout evolution. 
In mammals, PDEs are encoded by at least 10 gene families, each composed of one or more 
genes. In addition, numerous splice variants of individual gene family members have been 
described. These splice variants alter the 5' domain of the protein but share identical 
nucleotide binding and catalytic domains. The catalytic domain, found in the carboxy- 
terminus of the enzyme, is ~ 275 amino acids and highly conserved in amino acid sequence in 
all PDEs. In total, it appears that there are -50 PDEs expressed within the mammalian body. 
Some PDEs are expressed in multiple tissues while others have a very limited tissue-specific 
distribution (Conti and Jin, 1999). 

PDE gene families differ with respect to their affinity for cAMP and cGMP and their 
dependence on calcium and calmodulin (Beavo, 1995). Moreover, some PDEs are inhibited 
or activated by binding cyclic nucleotides to a non-hydrolytic site. For example, PDE2A has 
a lower for cGMP than cAMP although it hydrolysed both nucleotides. The binding of 
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cGMP to an allosteric activator site within PDE2 enhances the rate of catalysis of c AMP. 
PDE2 is, therefore, a cGMP-stimulated cGMP and cAMP phosphodiesterase (Beavo, 1995). 
Conversely, the affinity of PDE4 for cAMP is much greater than for cGMP and PDE4 
activity is not affected by cGMP or calmodulin (Beavo, 1995). The differences in substrate 
preference, modulation of activity and tissue-specific patterns of expression suggest that 
subtle alterations in the relative levels of cAMP and cGMP mediated through the action of 
various PDEs lead to a wide range of responses to extracellular signals. 

cGMP-PDE activity of compounds is measured using a one-step assay adapted from Wells at 
al. (Wells, J. N., Baird, C. E., Wu, Y. J. and Hardman, J. G., Biochim. Biophys. Acta 384:430 
(1975)) and adopted by Beavo et al, U.S. Patent No. 6,037,1 19. The reaction medium 
contains 50 mM Tris-HCl, pH 7.5, 5 mM Mg-acetate, 250 ug/ml 5'-Nucleotidase, 1 mM 
EGTA and 0.1 5 uM 8-[H 3 ]-cGMP. The enzyme used is a human recombinant PDE V 
(ICOS, Seattle U.S.A.). 

Compounds of interest are dissolved in DMSO finally present at 2% in the assay. The 
incubation time was 30 minutes during which the total substrate conversion did not exceed 
30%. 

The IC 50 values for the compounds examined are determined from concentration-response 
curves using typically concentrations ranging from 10 nM to 10 uM. Tests against other 
PDE enzymes using standard methodology also show compounds highly selective for the 
cGMP specific PDE enzyme. 
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Rat aortic smooth muscle cells (RSMC) are prepared according to Chamley et al. in Cell 
Tissue Res. 177:503-522 (1977) and used between the 10th and 25th passage at confluence in 
24-well culture dishes. Culture media is aspirated and replaced with PBS (0.5 ml) containing 
the compound tested at the appropriate concentration. After 30 minutes at 37° C, particulates 
guanylate cyclase are stimulated by addition of ANF (100 nM) for 10 minutes. At the end of 
incubation, the medium is withdrawn and two extractions were performed by addition of 65% 
ethanol (0.25 ml). The two ethanolic extracts are pooled and evaporated until dryness, using 
a Speed-vat system. c-GMP was measured after acetylation by scintillation proximity 
immunoassay (AMERSHAM). The EC 50 values are expressed as the dose giving half of the 
stimulation at saturating concentrations. 

EXAMPLE 12 - Selected Modulators of PDE10A Activity 

The catalytic domain of PDE10A is most similar in amino acid sequence to PDE5A, PDE2A, 
PDE6B and PDE6A. These members of the PDE family each contain a cGMP binding 
sequence that is not observed in other PDE family members. The non-catalytic cGMP 
binding sites (GAF) domains found in PDE2, 5 and 6 are also found in PDE 10. At least for 
PDE2, this site acts as an allosteric activator for cAMP hydrolytic activity. The GAF domain 
of PDE 1 OA binds other small molecules that act as allosteric activators. PDE 1 OA is a cAMP 
and cAMP -inhibited cGMP PDE (Fujishige et al., 1999; Fujishige et al., 1999; Loughney et 
al., 1999; Soderling et al., 1999). 

Attenuation of the production of cAMP, may ameliorate the symptoms of HD and positively 
affect gene expression. Pharmaceutically acceptable modulators of cAMP include quinpirole, 
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alloxan, miconazole nitrate, MDL-12330A, and tetracyline derivatives such as 
demeclocycline and minocycline. 

Compounds which are potent and selective modulators of cGMP-specific PDE, and are useful 
in a variety of therapeutic areas are taught by Daugan et al, U.S. patent No. 5,981,527, PCT 
publication No. WO 00/15639 to Icos Corporation and PCT publication No. WO 00/15228 to 
Icos Corporation, which are incorporated herein by reference. Such compounds include, for 
example: 

(6R,12aR)-2,3,6,7,12,12a-Hexahydro-6-(5-berizofuranyl)-2-methyl-pyrazino[2', 
1' : 6, 1 ]pyrido[3 ,4-b]indole- 1 ,4-dione, 

(6R,12aR)-2,3,6,7,12,12a-Hexahydro-6-(5-benzofuranyl)-pyrazino[2',l':6,l]py rido[3,4- 
]indole-l,4-dione, 

(6R,1 2aR)-2,3 ,6,7,12,1 2a-Hexahydro-6-(5-benzofuranyl)-2-isopropyl-pyrazino[ 
2 , ,l':6,l]pyrido[3,4-b]indole-l,4-dione, 

(3S,6R,12aR)-2,3,6,7,12,12a-Hexahydro-6-(5-benzofuranyl)-3-methyl-pyrazino[ 
2',r:6,l]pyrido[3,4-b]indole-l,4-dione, and 

(3 S,6R, 1 2aR)-2,3,6,7, 1 2, 1 2a-Hexahydro-6-(5-beriZofuranyl)-2,3-dimethyl-pyraz 
ino[2', 1 ' :6, 1 ]pyrido[3 ,4-b]indole- 1 ,4-dione. 

PDE1B1 is expressed throughout the brain and is most abundant in the striatum, nucleus 
accumbens and olfactory tubercle (Polli and Kincaid, 1994; Yan et al., 1994). PDE IB is a 
cGMP, Ca/calmodulin-dependent PDE. Therefore, PDE IB and 10A are both expressed in 
the majority, but not all, striatal neurons and, it is likely that both genes are co-expressed in a 
subset of striatal projection neurons. Selective inhibitors for PDE1 include KS-505, IC224, 
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and SCH 51866. Of these inhibitors, it appears that SCH 51866 has a ten-fold higher Km for 
PDE1 than PDE10 (Soderling et al., 1999). The non-specific PDE inhibitor IBMX is a potent 
inhibitor of PDE10A. Dipyridamole and SCH51866 had the highest potency of inhibitors 
tested on PDE10A activity. Dipyridamole was considered to be a PDE5- and PDE6-specific 
inhibitor, however, the Km for dipyridamole is 10 times higher for PDE10A than the other 
PDEs (Soderling et al., 1999). Selective inhibitors of PDE5, 2, 3 and 4 had much greater 
IC50 for PDE10 (Soderling et al., 1999). 

EXAMPLE 13 - Clinical use of PDE10A Modulator 

A 38 year-old female was admitted to hospital from a long-term care facility due to 
progressive deterioration of her physical and mental symptoms caused by Huntington's 
disease. The patient had been diagnosed with Huntington's disease at age 26. Prior to 
admission to the hospital, she had become increasingly aggressive and uncooperative. 
Moreover, there appeared to be an increase in the number of psychotic episodes. SPECT 
showed no abnormality of brain blood flow but MRI showed bilateral caudate atrophy as well 
as global atrophy of the cerebrum and corpus callosum. 

The patient had been stable for a number of years on the antipsycotic haloperidol (3 mg/day). 
For the last two years, the haloperidol had been replaced by olanzapine (2.5-7.5 mg/day). 

Minocycline, a tetracycline derivative, was administered at 50 mg twice daily for 7 days, 
followed by 100 mg twice daily for 7 days and finally 200 mg twice daily for 5 weeks. After 
5 weeks of 200 mg twice daily minocycline administration, there was a mild improvement 
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compared to the baseline clinical global assessment made at the time of admission. The 
minocycline treatment was suspended for 7 days. Due to a significant increase in the number 
of aggressive incidence and decrease in cooperativity, minocycline (200 mg twice daily) 
treatment was resumed. The patient responded within 3 days to the resumed minocycline- 
treatment with a return to mild-improvement compared to the baseline clinical global 
assessment made at the time of admission. Minocycline (200 mg twice daily) treatment will 
continue indefinitely. The improvement in behaviour and decrease in apparent psychosis has 
allowed for the transfer of the patient from the acute care facility back to long-term care. 

While the present invention has been described in terms of specific embodiments, it is 
understood that variations and modifications will occur to those skilled in the art. 
Accordingly, only such limitations as appear in the appended claims should be placed on the 
invention. 
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Gene Necessary for Striatal Function, Uses Thereof, and 
Compounds for Modulating Same 

Abstract 

PDE10A, a gene that is normally highly expressed in mammalian striatum and elsewhere, has 
been found to decrease in expression during the development of CAG repeat disorders such 
as Huntington's disease. The invention teaches a method for detecting the presence of or the 
predisposition for a CAG repeat disorder. Compounds which modulate CAG repeat disorders 
and their uses are taught. Methods for screening for further compounds to modulate CAG 
repeat disorders are also taught. 
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Figure 3 



5' 11 21 31 41 

- TGTATGGGAATAGTGTTT CCATATGATCTGTTGTCTGGAGTATATGCTAC 
x ACATACCCTTATCACAAA gGTATACTAGACAACAGACCTCATATACGATG 

5' 61 71 81 91 

c ATGTTCATTTACTGTACAAAAACCCAGTGCAGCTGATGATGCAAAGCAGT 
TACAAGf TAAA TGACATGTTT TTGGGTCACG TCGACTAHT A PffTTTrfiTra 

5' 11 21 31 41 

n CTCTCTCTGTGTACAGTGCCCCACCTATTTAAAAATCACGTACTTGCCCA 
x u GAGAGAGACA CATGTCACGG GGTGGATAAA TTTTTAGTGC ATGAACGGGT 

5' 61 71 81 91 

_ GAACACTGTG AAACACTTAA CATAAGAACAAACGCAGCGT CTGGATTCTT 
CTTGTGACAC TTTGTGAATT GTATTCTTGT TTGCGTCGCAGACCTAAGAA 

5' 11 p"*»2. 21 31 41 



2 0 1 TCCAAGGAGAG pAGCTTTCTC^CAGGAACACAGTAAC^AAGAG^ Tr'r'G 



AGGTTCCTCT CGTCGAAAGAGGTGTCCTTG TGT CATTGTT TTCTCCAGGC 

5 ' 61 71 81 91 

CCGCCATCCACACCCAGCCAAGACACCTCAGAGGCCATAGGGACAACCTC 
GGCGGTAGGT GTGGGTCGGT TCTGTGGAGT CTCCGGTATC CCTGTTGGAG 

5' 11 21 31 41 

_ n -■ CTTGCTGGCC AACACCTGCTGGAGCAGGGG CACAGGTCCC AGCAACTGAT 
GAACGACCGGTTGTGGACGACCTCGTCCCCGTGTCCAGGGTCGTTGACTA 

5' 61 71 81 91 

_ CCTCAGTGGATGGGTCTGCAGCCAAAGCCTTAATGGGCTCTCTTTTGAA.G 
J ^ X GGAGTCACCT ACCCAGACGT CGGTTTCGGAATTACCCGAG AGAAAACTTC 

5« 11 21 31 41 

. GGGAAAGAAA GAATTTCAAG CTTATGATAT CCAATATTAT TATAGTTGAT 
CCCTTTCTTT CTTAAAGTTC GAATACTATA GGTTATAATA ATATCAACTA 

5' 61 71 81 91 

GAGTTAGTAAATTCCAAAAAAAAAA 
b CTCAATCATT TAAGGTTTTT TTTTT 
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Figure 10 



51 11 21 31 41 

CACTGAAGCT GGTCCACGTC TATAAACAGG TGACACTGGC TGCAGCAAAA 
1 GTGACTTCGA C CAGGTGCAG ATATTTGTC C ACTGTGACCG ACGTCGTTTT 

5 « 61 71 81 91 

AGCCATTCGATCCACACAAATTGATCTTCTATCATCTTGGAATCTGAATT 
5 1 TCGGTAAGCTAGGTGTGTTTAACTAGAAGATAGTAGAACCTTAGACTTAA 

5. 11 21 31 41 

GCAGGGAGGAGCAGTATGTAAGACGACCGT TTAATTCAGG CATTCCGAAG 
101 CGTC CCTCCT CGTCATACAT TCTGCTGGCA AATTAAGTCC GTAAGGCTTC 

5 1 61 71 81 91 

G GATGAGCGC ATGGATTCTG TCAC CAAGCG TATAAAAGGA CCCTGGCATT 
151 CGTAC TCGCG TACCTAAGAC AGTGGTTCGC ATATTTTCCT GGGACCGTAA 

5. 11 21 31 41 

GGGAAACCTATGACGGACTGTTTTTGCTGTAGAAGTAGGGATTTTACAGA 
201 CCCTTTGGAT ACTGCCTGAC AAAAACGACATCTTCATCCC TAAAATGTCT 

51 61 71 81 91 

AGTCTCCTTGAATTTGCCCTGCCTGGGGCAGTTTTGCAGAGGAACCTGCC 
251 TCAGAGGAAC TTAAACGGGACGGACCCCGT CAAAACGTCT CCTTGGACGG 

51 11 21 31 41 

AGAGATTTAT TGGCTGGTCAGTCTCTTGTG AAATAGTATC ATGTGAGAAA 
301 TCTCTAAATAACCGACCAGT CAGAGAACAC TTTATCATAG TACACTCTTT 

51 61 71 81 91 

CAGTTTGTAG AAAAAAACTA TAC CTGGGAAGACCTTTGCAACATTGTTCC 
351 GT cAAACATC TTTTTTTGAT ATGGACCCTT CTGGAAACGT TGTAACAAGG 

51 11 21 31 41 

TTCCATGGGC CAAGACTCAG TTAGGAGGCATAAATCTGCC CGGAATAAAC 
401 AAGGTACCCGGTTCTGAGTCAATCCTCCGTATTTAGACGGGCCTTATTTG 

51 61 71 81 91 

TAGGCCAGGATACAGCCATG TTTAGTTAAT AATTTGGTTT TAGAATTCAC 
451 ATCCGGTCCTATGTCGGTACAAATCAATTATTAAACCAAAATCTTAAGTG 

51 11 21 31 41 

ACAGGCAGGATTGGTTTTTT TGTGTCTTGG CAAGTGGAGC ATATTTAACA 
501 TGTCC GTCCT AACCAAAAAAACACAGAACC GTTCACCTCG TATAAATTGT 

5. 61 71 81 91 

TACAGGCATG GGAATCCTGC CTCTTAGCTT TTCCCACCCT CTTGTCTCAC 
551 ATG TCCGTAC CCTTAGGACG GAGAATCGAAAAGGGTGGGA GAACAGAGTG 

51 11 21 31 41 

CAAGTTTTTT CTCTC CAAAG GTTTCCAGGAATTTCTCATT AATGGCTGAT 
601 GTTCAAAAAAGAGAGGTTTCCAAAGGTCCTTAAAGAGTAATTACCGACTA 
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Figure 10 continued 



5' 61 71 81 91 

GCAAACTTAG TGAAT AATAA TGAATATAAA CAATGCTCAC CTCACCAAAA 
bi3± CGTTTGAATC ACTTATTATT ACTTATATTT GTTACGAGTG GAGTGGTTTT 

5 r 11 21 31 41 

TTATATTATT TGCAGTCATT TGTGATAACA GAAATTTTAT CGCAATGGTT 
701 AATATAATAA ACGTCAGTAA ACACTATTGT GTTTAAAATA GCGTTACCAA 

5' 61 71 81 91 

ATTATTTAAT TTGTGGC CAC ACACTGTGGT TATCTTTTGT TGTGGTTGTT 
751 TAATAAATTAAACACCGGTGTGTGACACCAATAGAAAACAACACCAACAA 

5 1 11 21 31 41 

TCTGAGAAAA TGTTCTTGGA TATGTAAGTG CCAATACCAG TGTGAAGTAT 
8 AGACTCTTTT ACAAGAACCT ATACATTCAC GGTTATGGTC ACACTTCATA 

5 ' 61 71 81 91 

TGATCCCGGG CAGCAAAATA CAGCCTAAGG TTTGTAAACATCAATTCTAT 
8 b ACTAGGGCCC GT CGTTTTAT GTCGGATTCC AAACATTTGT AGTTAAGATA 

5' 11 21 31 41 

oni CTCAGTTCAT CAGAGGGCCT GAGAAGCTGC GGGGCAGTGTAAAGTAAAGT 
901 GAGTCAAGTAGTCTCCCGGACTCTTCGACGCCCCGTCACATTTCATTTCA 

5 1 61 71 81 91 

ATGCTGGGCTGGTGGTGGTC AGCCTCCCGC CTGAAGAGTGACCAGTGCTG 
951 TACGACCCGA CCACCACCAG TCGGAGGGCG GACTTCTCAC TGGTCACGAC 

5" 11 21 31 41 

- nn _ GCCCGACGGATCGCTGAGATATTCTCCCATAATGGCAAAAAAATAGGCAG 
1001 CGGGCTGCCT AGCGACTCTA TAAGAGGGTATTACCGTTTT TTTATCCGTC 

5' 61 71 81 91 

_ nc _ TTTGATGTGACCTGTTTAGT GTGGCTCTCC TCTTTTGAGC ATGTGTTAGC 
AAACTACACT GGACAAATCA CACCGAGAGG AGAAAACTCG TACAGAATCG 

5' 11 21 31 41 

ATTTTTATTT TATACTCATC CAGTGAACTC TGCTCTTCGA AGTGTGTTCA 
1101 TAAAAATAAAATATGAGTAG GTCACTTGAG ACGAGAAGGT TGA.CACAAGT 

5' 61 71 81 91 

_ n TGTATGTGCTAGATATATTAGCACAGCCTG CCTTCTGCTG CACAACGCCT 
ACATACACGA TCTATATAAT CGTGTCGGAC GGAAGACGAC GTGTTGCGGA 

5' 11 21 31 41 

_ on _ TAGAGACCCG GCCTTTCAAT GAGCTTAGCT TGTGCTCTGT TTCTGCTCTC 
12 01 jytcTCTGGGC CGGAAAGTTA CTCGAATCGAACACGAGACA AAGACGAGAG 

5 ' 61 71 81 91 

_ _ j. _ TTAGGTCTAA ACTATGGTGT CAGTTTTAAT AGAACAAAAG TATGCATCTT 
LZt>L AATCCAGATTTGATACCACAGTCAAAATTATCTTGTTTTC ATACGTAGAA 
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Figure 10 continued 



5' 11 21 31 41 

GCCTTGGCTT GAGCCTTTTC GTTTTCAATG CTGACTTCTC CCCTTTCTCT 
1J 01 CGGAACCGAA CTCGGAAAAG CAAAAGTTAC GACTGAAGAG GGGAi' AGAGA 

5' 61 71 81 91 

CCTGTGCTCA CCTTACCTTT CCAGAGTGTA AGGGACAACT TTTAAGGAGG 
1 3 b 1 GGACACGAGT GGAATGGAAA GGTCTCACAT TCCCTGTTGAAAATTCCTCC 

5' 11 21 31 41 

CGTGTCCCTG GTAGGGGCAT CCCTGTTCAC CAGGTGCCTG TCATCACCCC 
1401 GCACAGGGAC CATCCCCGTA GGGACAAGTG GTCCACGGAC AGTAGTGGGG 

5' 61 71 81 91 

AI .- ACTTGACTGACATCTACCCT GGTGACTATG GGTTCCTCTT GTTTGTAGGG 
1451 TGAACTGACT GTAGATGGGA CCACTGATAC CCAAGGAGAA CAAACATCCC 

5' 11 21 31 41 

_ crn AACGGTGGCT CCAGGTGGAG GCATCAATCT GTTGGGTTCT GGTTCCCGGC 
1501 TTGCCACCGAGGTCCACCTC CGTAGTTAGA CAACCCAAGA CCAAGGGC CG 

5' 61 71 81 91 

TGCCTTTGGTTTTGAAAGTC TCTTCTCTGTATATTCCTAC CCTGCATTTG 
15 ACGGAAACCAAAACTTTCAG AGAAGAGACATATAAGGATGGGACGTAAAC 

5' 11 21 31 41 

_ CTTTGTGTGGTGCTGATGCTGTGCGCAGTAGGATTCTTGGATGACTCTCC 
GAAACACACC ACGACTACGA CACGCGTCAT CCTAAGAACC TACTGAGAGG 

5 r 61 71 81 91 

^ ATCAGTCACA GACTC CCCCT GTTGCAAAGT GTCAGGCTGA CTCGACAGTC 
1651 TA G T CAGTGT CTGAGGGGGA CAACGTTTCA CAGTCCGACT GAGCTGTCAG 

5' 11 21 31 41 

n _ ACCGTAAAAT CTGAGTCAGT CACACACAGG CTGTCAGCCACGGCTTCCAC 
1701 TGGCATTTTAGACTCAGTCAGTGTGTGTCC GACAGTCGGT GCCGAAGGTG 

5' 61 71 81 91 

TTGCATGGCT ATTCTATTTT GA.CACGTGAG TTTCTGTTGC TGGCTGGCTG 
171 AACGTACCGATAAGATAAAAGTGTGCACTC AAAGACAACG ACCGACCGAC 

5' 11 21 31 41 

ACTGGCATTATCTATGCTAAGTTGAAATCAGGAGTGCCCAGCAGAGCCCA 
18 01 TGAC CGTAAT AGATACGATT CAACTTTAGT CCTCACGGGT CGTCTCGGGT 

5 ' 61 71 81 91 

TCATTCTCAC TGTCTTTGAA ACAAAGCTGT ACGGTTTGAT CGATGAACGT 
1 8 b 1 AGTAAGAGTG ACAGAAACTT TGTTTCGAGA. TGCCAAACTAGCTACTTGCA 

5' 11 21 31 41 

ATTTAAAGCATTTCATGCAA TGACAAAGTG CTCAGTAGTGGAAGGCAGGC 
X y TAAATTTCGT AAAGTACGTT ACTGTTTCAC GAGTCATCAC CTTCCGTCCG 
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Figure 10 continued 



5* 61 71 81 91 

_ . c _ TGTGACCAGT CTGCCTGCTC CTTACTATAA TTGTGAGGAT TTGTTACTGG 
1951 ACACTGGTCAGACGGACGAG GAATGATATT AACACTCCTAAACAATGACC 

5' 11 21 31 41 

AACAGTACAT GGAGGCCTGA CCTTGTGGGG GCACAGGGTG GAACCTTAGC 
2001 TTGTCAT gtA CCTCCGGACT GGAACACCCC CGTGTCCCAC CTTGGAATCG 

5« 61 71 81 91 

_ rt c - TGAATATAGT GTGTGTCTCA AGAGGAAGTC AGGGTACTAG CTCAGTGCTC 
2051 ACTTATATCACACACAGAGTTCTCCTTCAGTCCCATGATCGAGTCACGAG 

5' 11 21 31 41 

AATCTCCAGG TACTATATAT ACATTTGCCC GTTTTATCTC TAATGTGAAA 
2101 ■pTAGAGGTCCATGATATATATGTAAACGGG CAAAATAGAGATTACACTTT 

5* 61 71 81 91 

0 _ TAAATCCCCAAACACTTGTT TATCGTGTAG CGTACCTAAAAGACTATTCT 
2151 ATTTA QGGGT TTGTGAACAAATAGCACATC GCATGGATTT TCTGATAAGA 

5« 11 21 31 41 

0 0 _ _ ATTATGGGTG TCCCCACTTT CTTGGTTTGGTCACCCCGAT CCCCCGGTCT 
2201 TAATACCCACAGGGGTGAAAGAACCAAACCAGTGGGGCTAGGGGGCCAGA 

5« 61 71 81 91 

0 „ c _ TCTGCTGTAT CTAGAACAGT GACTATAAAT GATGTATGGG AATAGTGTTT 
2 AGACGACATAGATCTTGTCA CTGATATTTA CTACATACCC TTATCACAAA 

5' 11 21 31 41 

0 _ n _ CCATATGATC TGTTGTCTGG AGTATATGCTACATGTTCAATTACTGTACA 
2301 GGTATACTAGACAACAGACCTCATATACGATGTACAAGTTAATGACATGT 

5' 61 71 81 91 

0 . C1 AAAACCCAGTGCAGCTGATGATGCAAAGCAGTCTCTCTCTGTGTACAGTG 
2351 <p<pTTGGGTCA CGTCGACTAC TACGTTTCGT CAGAGAGAGA CACATGTCAC 

5' 11 21 31 41 

_ . n _ CCCCACCTAT TTAAAAATCA CGTACAAS CC CAGAACACTG TGAAACACTT 
GGGGTGGATA AATTTTTAGT GCATGTTSGG GTCTTGTGAC ACTTTGTGAA 

5' 61 71 81 91 

„ . c _ AACATAAGAA CAAACGCAGC GTCTGGATTC TTTCCAAGGAGAGCAGCTTT 
2451 TT GTATTCTT GTTTGCGTCG CAGACCTAAG AAAGGTTCCT CTCGTCGAAA 

5 1 11 21 31 41 

CTCCACAGGAACACAGTAAC AAAAGAGGTC CGCCGCCATC CACACCCAGC 
2501 GAGGTGTCCT TGTGTCATTG TTTTCTC CAG GCGGCGGTAG GTGTGGGTCG 

5 1 61 71 81 91 

CAAGACACCT CAGAGGCCAT AGGGACAACC TCCTTGCTGG CCAACACCTG 
2. b b J. GTTCTGTGGAGTC TCCGGTATCCCTGTTGG AGGAACGACC GGTTGTGGAC 
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Figure 10 continued 



5' 11 21 31 41 

CTGGAGCAGG GGCACAGGTC CCAGCAACTG ATCCTCAGTG GATGGGTCCG 
^ b UX GACCTCGTCC CCGTGTCCAG GGTCGTTGAC TAGGAGTCAC CTACCCAGGC 

5' 61 71 81 91 

CAGTCAAAGC CTTAATGGGC TCTCTTTTGAAGGGGAAAGA AAGAATTTCA 
2. b b ± GTCAGTTTCGGAATTACCCGAGAGAAAACTTCCCCTTTCTTTCTTAAAGT 

5' 11 21 31 41 

AGCTTATGATATCCAACATTATTATAGTTGATGAGTTAGTAAATTCCAAA 
^ /Ui TCGAATACTA TAGGTTGTAA TAATATCAAC TACTCAATCA TTTAAGGTTT 

5« 61 71 81 91 

!: AAAAAAAGAT GATTTTATAT GTATGAGATA AAAAAAATCT TTGTAAAGTG 

J Z 1 b - 1 TTTTTTTCTA CTAAAATATA CATACTGTAT TTTTTTTAGAAACATTTCAC 

5 1 11 21 31 41 

I OQni CGCAAGTGCAATAATTTAAA GAGGTCTTAT CTTTGCATTT ATAAATTATA 
j Z 3 ° 1 GCGTTCACGT TATTAAATTT CTCCAGAATAGAAACGTAAATATTTAATAT 

5 1 61 71 81 91 

' AATATTGTACATGTGTGTAATTTTTCATGTATTCATTTGCAGTCTTTGTA 
Z b b 1 TTATAACATG TACACACATT AAAAAGTACA TAAGTAAACG TCAGAAACAT 

[5' 11 21 31 41 

■ OQn . TTTAAAAAAA CTTTACTGTT ATGTTTGTAT AATAGAACAT TAATCATTTA 

! Z y U L AAATTTTTTT GAAATGACAATACAAACATATTATCTTGTAATTAGTAAAT 

5 « 61 71 81 91 

_ Q _ TTATAACTCAGACAAGGTGT AAATAAATTC ATAATTCAAA CAGCCAGTAT 
AATATTGAGTCTGTTCCACATTTATTTAAGTATTAAGTTTGTCGGTCATA 

5' 11 21 31 41 

n ATATGCATAT ATGGGTGTTA CATTGCAAAAATCTCTATCT TTGTTCTATT 
6 U Ul TATA CGTATATACCCACAATGTAACGTTTTTAGAGATAGAAAC!AAGATAA 

5' 61 71 81 91 

CACATGCTTAAAGAAGTAAG AAATCTTTTG TGGATATGTAATTATACATA 
GTGTACGAAT TTCTTCATTC TTTAGAAAAC ACCTATACAT TAATATGTAT 

5' 11 21 31 41 

TAAAGTATATATATATGTATGATACATGAAATATATTTAGAAATGTTCAT 
ATTTCATATATATATACATA CTATGTACTT TATATAAATC TTTAGAAGTA 

5' 61 71 81 91 

AATTTTAATGGATATTCTTTGGTGTGAATAATTGAATACAACATTTTTAA 
6 ±bl TTAAAATTAC CTATAAGAAA C CACACTTAT TAACTTATGT TGTAAAAATT 

5' 11 21 31 41 

-. 9ni AATGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
TTACTTTTTTTTTTTTTTTT TTTTTTTTTT TTTTTT 
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Figure 12 



5' 11 21 31 41 

1 AAGTGTAAATAAAATAAACATCTAATAAAAAAAATTACATACCATAGAGG 
TTCACATTTA TTTTATTTGT AGATTATTTT TTTTAATGTATGGTATCTCC 

5» 61 71 81 91 

„ AACAAGATAATTTCTGCCCAACTTCATACCCTCCAGCGTATAGTGTTGAG 
TTGTTCTATT AAAG ACGGGT TGAAGTATGG GAGGTCGCAT ATCACAACTC 

5' 11 21 31 41 

GTTTGGTCTG TTGCTGTGTATTGTAATGTA ATGTTAAATT CTCTACCTGA 
X U - 1 - CAAACCAGAC AACGACACATAACATTACATTACAATTTAAGAGATGGACT 

5' 61 71 81 91 

- q - AGGTCTAGGC CTACAAGTGAATTCTCATGTTTATAGAGTT TTGTTGTGCA 
TCCAGATCCG GATGTTCACT TAAGAGTACAAATATCTCAAAACAACACGT 

5' 11 21 31 41 

9ni AACCTTGTTC CTTAATTTAAAACTATGGTT AAAAAACAAAACAAAACTGG 
TTGGAACAAGGAATTAAATTTTGATACCAATTTTTTGTTT TGTTTTGACC 

5 ' 61 71 81 91 

9 1- n CTACAGCCAATAACTGAAGG GGGTTACCTT GTTGAAGGGG TGGAAAAGAG 
z 3 x GATGTCGGTTATTGACTTCC CCCAATGGAACAACTTCCCC ACCTTTTCTC 

5' 11 21 31 41 

_, ni AGAGGAGGAA GAAGGGAGTT CAAGAGAAGG AGAAGAACAA GAGGAGAGGA 
TCTCCTCCTT CTTCCCTCAAGTTCTCTTCC TCTTCTTGTT CTCCTCTCCT 

5 r 61 71 81 91 

_ t- - GGAAGCTGCCACGAGGGGAG ATGGGCCATG AGAACTTGGC CAGGAGAAAT 
■ - it3X CCTTCGACGG TGCTCCCCTC TACCCGGTAC TCTTGAACCG GTCCTCTTTA 

5' 11 21 31 41 

. n - AGCCAGTATC TGGAGTACAC CACTGAGGAG GTAGCCAGGC TAGCAGTTAG 
TCGGTCATAG ACCTCATGTG GTGACTCCTC CATCGGTCCG ATCGTCAATC 

5' 61 71 81 91 

. t- _ AAGAGTAGAT TAGGGGTTAT TTTTCCCCCA CTCCACATAG TTATCAAAGC 
* r> j. TTCT ck TCTAATCCCC ^TAAAAAGG^ 

5' 11 21 31 41 

^ „ CAAATAAAATAACCATAGTC TGAGTCTCAT CTATTTGTAA GCTAGTTGGG 
GTTTATTTTATTGGTATCAGACTCAGAGTAGATAAACATT CGATCAACCC 

5' 61 71 81 91 

t-r, TATAAGATTA ATTTGGCTGT ACTACAGTTT AGATTTCTAA CATAGGAACT 
3 x ATATTCTAAT TAAACCGACATGATGTCAAATCTAAAGATT GTATCCTTGA 

5' 11 21 31 41 

G01 ATCAAAAACT TGCTCAAACA. AGAACATGCT GACAATATTT TAAAATGATT 
TAGTTTTTGA ACGAGTTTGT TCTTGTACGA CTGTTATAAAATTTTACTAA 



16/41 



Figure 12 continued 



5- 61 71 81 91 

ATTTATATTG TTTGCACTTT CTAAAGTTTC TTCTAAATGT TCCATGGTCA 
651 TAAATATAAC AAACGTGAAAGATTTCAAAGAAGATTTACAAGGTACCAGT 

5< 11 21 31 41 

AATTAAAAAATATACATATTGGCTATTAAATTCGTCTAAGTGGGGCTGGA 
701 TTAATTT TTTATATGTATAACCGATAATTT AAGCAGATTC ACCCCGACCT 

5 . 61 71 81 91 

GAGATAGCTCAGAGGTTAAGAGCACTGACTGCTCTTCCAGAGGTCCTGAG 
751 CTCTATCGAG TCTCCAATTC TCGTGACTGA CGAGAAGGTC TCCAGGACTC 

5- 11 21 31 41 

TTCAATTCCC AGCGACCACATGGTGGCTCACAGCCATCTG TAATAGATAG 
801 AAGTTAAGGG TCGCTGGTGT ACCACCGAGT GTCGGTAGAC ATTATCTATC 

5 . 61 71 81 91 

GATCTGACGC CCTCTTCTGGAGTGTCTGAAGACAGCTACAATGTACTCAT 
851 CTAGACTGCG GGAGAAGACC TCACAGACTT CTGTCGATGT TACATGAGTA 

51 11 21 31 41 

ATATATTAAATAAATAATATTAGAAAATTCTTCTAAGTGTATCATTTATA 
901 ■pj^TATAATTTATTTATTATAATCTTTTAAGAAGATTCACATAGTAAATAT 

51 61 71 81 91 

GAATATTTAATATATAAAGTAAATGCCTCAGGAAATATAAACTTGGAATT 
951 CTTATA AATT ATATATTTCATTTACGGAGT CCTTTATATT TGAACCTTAA 

5 t 11 21 31 41 

AAATCAAAGAACTTCATGAG TAGTGGGCCA CAAAAAATGT GTACCAGGGG 
1001 -pTTAGTTTCT TGAAGTACTC ATCACCCGGT GTTTTTTACA CATGGTCCCC 

5. 61 71 81 91 

AAGACCGGAGGGAGGGGAGAAGGAAGGGATGGAGATAGAATTTTGCCTCT 
1051 «£TCTGGCCTC CCTCCCCTCT TCCTTCCCTACCTCTATCTT AAAACGGAGA 

51 11 21 31 41 

GCATTCCTTGGGCTGGCACAGGTATAATGCTGTGGGAATTGGGAAACTAC 
1101 CGTAAG GAAC CCGACCGTGT CCATATTACG ACACCCTTAA CGCTTTGATG 

5. 61 71 81 91 

AAGGAAGCTG CAAAGCTGGG CGGAACTCGT TTCCGCAAGC TGGGCTCATC 
1151 rprpccTTCGAC GTTTCGACCC GCCTTGAGCA AAGGCGTTCG ACCCGAGTAG 

51 11 21 31 41 

TAAGTGTCGA.TGCATGGCTG CCACACTGCA GTGAACTTTAAAACATTTGT 
1201 ATTC ACAGGT ACGTACCGACGGTGTGACGT CACTTGAAAT TTTGTAAACA 

5 t 61 71 81 91 

GTTCCAGAGATGTAGAGATG CTCACAATAG TACAAAGGCG GGAGGGAGGT 
1251 CAAGGTCTCT ACATCTCTACGAGTGTTATCATGTTTCCGC CCTCCCTCCA 
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Figure 12 continued 



5» 11 21 31 41 

_ n ATTTCCAGAC TAAGAGGAAG AAAAACCATT GCTGATTAAA CATCTGCATA 
TAAAGGTCTG ATTCTCCTTC TTTTTGGTAA CGACTAATTT GTAGACGTAT 

5' 61 71 81 91 

TGAGCGCCCCC^CCTCCATA(^CACACACACAC^ 

ACTCGCGGGG GTGGAGGTAT GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT 

5' 11 21 31 41 

CAACCAAACA GAACAAATAC ACATGCATGT CTACAGCCTG CAGGAACAAA 
x * u x GTTGGTTTGT CTTGTTTATG TGTACGTACAGATGTCGGAC GTCCTTGTTT 



5" 61 71 81 91 

ATGGTATGTC TGTGAGGAAC CAGGAGATGC AGAGGTCCTAACCTCTGTCT 
TACCATACAG ACACTCCTTG GTCCTCTACG TGTCCAGGAT TGGAGACAGA 



5' 11 21 31 41 

-em CCTACAAGCC CTGAAGTCTG GTCAGGGTCA AATGTACAAAAGCAGGCTAA 
GGATGTTCGGGACTTCAGACCAGTCCCAGTTTACATGTTTTCGTCCGATT 

5' 61 71 81 91 

GGAAGCTGTT TAGTGAAAGA TTTTTTTCTT CAACTCTAGGAACAACCTAT 
X *^ X CCTTCGACAAATCACTTTCT AAAAAAAGAAGTTGAGATCC TTGTTGGATA 

5' 11 21 31 41 

- TTCCTAGGAT TTGGAGAGTG CTCAGGAGGA AACATTCAGA CAACTGATGC 
AAGGATCCTAAACCTCTCACGAGTCCTCCTTTGTAAGTCTGTTGACTACG 

5 1 61 71 81 91 

fit - n TCTCTGTGTACCCCAGATTCAGGTATTGGGGTAGTTAGTTGTGCTCATGT 
■ LbiD± AGAGACACAT GGGGTCTAAG TCCATAACCC CATCAATCAA CACGAGTACA 

5' 11 21 31 41 

„ n ATGTGCTAGATATATTAGCACAGCCTGCCTTCTGCTGCACAACGCCTTAG 
± /UJ. TACAC GATCTATATAATCGTGTCGGACGGAAGACGACGTGTTGCGGAATC 

5 1 61 71 81 91 

_ nt .- AGACCCGGCC TTTGAATGAG CTTAGCTTGT GCTCTGTTTC TGCTCTCTTA 
TCTGGGCCGG AAAGTTACTC GAATCGAAGA CGAGACAAAG ACGAGAGAAT 

5' 11 21 31 41 

fin GGTCTAAACTATGGTGTCAGTTTTAATAGAACAAAAGTATGCATCTTGCC 
XtiUX CCAGATTTGA TACCACAGTC AAAATTATCT TGTTTTCATA CGTAGAACGG 

5' 61 71 81 91 

TTGGCTTGAG CCTTTTCGTT TTCAATGCTGACTTCTCCCCTTTCTCTCCT 
X a * X AACCGAACTC GGAAAAGCAAAAGTTACGAC TGAAGAGGGG AAAGAGAGGA 

5« 11 21 31 41 

qm GTGCTCACCTTACCTTTCCAGAGTGTAAGGGACAACTTTTAAGGAGGCGT 
CACGAGTGGAATGGAAAGGT CTCACATTCC CTGTTGAAAA TTCCTCCGCA 
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5' 61 71 81 91 

_ Qt __ GTCCCTGGTAGGGGCATCCCTGTTCACCAGGTGCCTGTCATCACCCCACT 
1 y b X CAGGGACCAT CCCCGTAGGG ACAAGTGGTC CACGGACAGT AGTGGGGTGA 

5* 11 21 31 41 

_ _ _ . TGACTGACAT CTACCCTGGT GACTATGGGT TCCTCTTGTT TGTAGGGAAC 
20 ACTGACTGTA GATGGGACCA CTGATAC CCA AGGAGAACAA ACATC CCTTG 

5< 61 71 81 91 

GGTGGCTCCAGGTGGAGGCATCAATCTGTTGGGTTCTGGTTCCCGGCTGC 
2051 CCACCGAGGTCCACCTCCGTAGTTAGACAACCCAAGACCAAGGGCCGACG 

5' 11 21 31 41 

CTTTGGTTTT GAAAGTCTCT TCTCTGTATATTCCTACCCT GCATTTGCTT 
3 2101 GAAACCAAAACTTTCAGAGAAGAGACATATAAGGATGGGACGTAAACGAA 

n 5 1 61 71 81 91 

3 TGTGTGGTGC TGATGCTGTG CGCAGCAGGATTCTTGGATGACTCTCCATC 

3 2151 ACACACCACGACTACGACACGCGTCGTCCTAAGAACCTAC TGAGAGGTAG 

3 5' 11 21 31 41 

0 AGTCACAGAC TCCCCCTGTT GCAAAGTGTC AGGCTGACTC GACAGTCACC 
22 01 TCAGTGTCTGAGGGGGACAACGTTTCACAG TCCGACTGAG CTGTCAGTGG 

3 5' 61 71 81 91 

1 GTAAAATCTG AGTCAGTCAC ACACAGGCTG TCAGCCACGG CTTCCACTTG 
i CATTTTAGAC TCAGTCAGTG TGTGTCCGAC AGTCGGTGCC GAAGGTGAAC 

% 5 « 11 21 31 41 

CATGGCTATT CTATTTTCAC ACGTGAGTTT CTGTTGCTGG CTGGCTGACT 
2301 QTACCGATAAGATAAAAGTG TGCACTCAAA GACAACGACC GACCGACTGA 

5' 61 71 81 91 

GGCATTATCT ATGCTAAGTT GAAATCAGGG GTGCCCAGCAGAGCCCATCA 
2351 CCGTAATAGATACGATTCAA CTTTAGTCCC CACGGGTCGT CTCGGGTAGT 

5' 11 21 31 41 

TTCTCACTGT CTTTGAAACAAAGCTGTACG GTTTGATCGA TGAACGTATT 
2401 AAGAGTGACAGAAACTTTGT TTCGACATGC GAAACTAGCTACTTGCATAA 

5' 61 71 81 91 

TAAAGCATTT CATGCAATGA CAAAGTGCTC AGTAGTGGAA GGCAGGCTGT 
2451 ^i>TTCGTAAA GTACGTTACT GTTTCACGAG TCATCAC CTT CCGTCCGACA 

5' 11 21 31 41 

GACCAGTCTG CCTGCTCCTT ACTATAATTG TGAGGATTTG TTACTGGAAC 
2501 CTGGTCAGACGGAC GAGGAATGATATTAAC ACTCCTAAACAATGACCTTG 

5' 61 71 81 91 

_ c c - AGTACATGGAGGCCTGACCT TGTGGGGGCA CAGGGTGG AA C CTTAGCTG A 
2551 TCATGTACCT CCGGACTGGAACACCCCCGT GTCCCACCTT GGAATCGACT 
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5' 11 21 31 41 

2601 A TATAGTGTG TGTCTCAAGAGGAAGTCAGG GTACTAGCTC AGTGCTCAAT 
TATATCACAC ACAGAGTTCT CCTTCAGTCC CATGATCGAG TCACGAGTTA 

51 61 71 81 91 

2651 CTCCAGGTAC TATATATACATTTGCCCGTT TTATCTCTAATGTGAAATAA 
GAGGTCCATGATATATATGTAAACGGGCAAAATAGAGATTACACTTTATT 

5 ' 11 21 31 41 

2701 ATCCCCAAAC ACTTGTTTAT CGTGTAGCGT ACCTAAAAGA CTATTCTATT 
TAGGGGTTTG TGAACAAATA GCACATCGCA TGGATTTTCT GATAAGATAA 

5' 61 71 81 91 

2751 ATGGGTGTCC CCACTTTCTTGGTTTGGTCACCCCGATCCC CCGGTCTTCT 
TACCCACAGG GGTGAAAGAA C CAAACCAGT GGGGCTAGGG GGCCAGAAGA 



0 5 ' 11 21 31 41 

%801 GCTG ^ TCmG ^ c ^TGACTATAAATGATGTATGGGAATAGTGTTTCCA 
B CGACATAGATCTTGTCACTGATATTTACTACATACCCTTATCACAAAGGT 

■5' 61 71 81 91 

^2 851 TATGATCTGT TGTCTGGAGT ATATGCTACA TGTTCATTTA CTGTACAAAA 

m ATACTAGACAACAGACCTCATATACGATGTACAAGTAAATGACATGTTTT 

5' 11 21 31 41 

?2 9 0 1 A CCCAGTGCAGCTGATGATG CAAAGCAGTC TCTCTCTGTG TACAGTGCCC 

1 J TGGGTCACGT CGACTACTAC GTTTCGTCAG AGAGAGACAC ATGTCACGGG 

□ 5' 61 71 81 91 

1951 ( ^ GGT ^TTTAAAAATCACGTACTTGCCCAGAACACTGTGAAACACTTAAC 
GTGGATAAATTTTTAGTGCATGAACGGGTC TTGTGACACT TTGTGAATTG 

5 ' 11 21 31 41 

3001 -^■TAAGAACAAACGCAGCGTC TGGATTCTTT CCAAGGAGAG CAGCTTTCTC 
TATTCTTGTT TGCGTCGCAG ACCTAAGAAA GGTTCCTCTC GTCGAAAGAG 

5' 61 71 81 91 

3 051 ^^GGAAC^CAGTAACIAAAAGAGGTCCGCCGCCATCCACACCCAGCCAA 
GTGTCCTTGT GTCATTGTTT TCTCCAGGCG GCGGTAGGTG TGGGTCGGTT 

5' 11 21 31 41 

3101 GACACCTCAGA GGCCATAGGGACAACCTCCTTGCTGGCCAACACCTGCTG 
CTGTGGAGTC TCCGGTATCC CTGTTGGAGG AACGACCGGT TGTGGACGAC 

5 ' 61 71 81 91 

3 151 GAG ^ GGGGCACAGG TCCCAGCAACTGATCCTCAGTGGATGGGTCTGCAG 
CTCGTCCCCG TGTCCAGGGT CGTTGACTAG GAGTCACCTA CCCAGACGTC 

5 ' 11 21 31 41 

32 01 CCAAAGCG TTAATGGGCTCT CTTTTGAAGG GGAAAGAAAG AATTTCAAGC 
GGTTTCGGAATTACCCGAGAGAAAACTTCC CCTTTCTTTC TTAAAGTTCG 
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5' 61 71 81 91 

_ „ t- - TTATGATATC CAATATTATTATAGTTGATGAGTTAGTAAATTCGAAAAAA 

3 J b AATACTATAG GTTATAATAA TATCAACTAC TGAATCATTTAAGGTTTTTT 

5' 11 21 31 41 

_ _ _ - AAAAGATGAT TTTATATGTATGACATAAAAAAAATCTTTG TAAAGTGCGC 
33 01 ■pTTTCTACTAAAATATACATACTGTATTTTTTTTAGAAACATTTCACGCG 

5' 61 71 81 91 

_ _ ^ _ AAGTGCAATA ATTTAAAGAG GTCTTATCTT TGCATTTATAAATTATAAAT 
33 51 TTCAC GTTATTAAATTTCTC CAGAATAGAA ACGTAAATAT TTAATATTTA 

5' 11 21 31 41 

, A n - ATTGTACATG TGTGTAATTT TTCATGTATT CATTTGCAGT CTTTGTATTT 
3401 TAACATGTAC ACACATTAAA AAGTACATAA GTAAACGTCA GAAACATAAA 

a 5 1 61 71 81 91 

: i AAAAAAACTT TACTGTTATG TTTGTATAAT AGAACATTAATCATTTATTA 

3451 TTTTTTTGAAATGACAATACAAACATATTATCTTGTAATTAGTAAATAAT 

I 5' 11 21 31 41 

4 TAACTCAGACAAGGTGTAAATAAATTCATAATTCAAACAG CCAGTATATA 
3501 ATTG A GTCTG TTCCACATTT ATTTAAGTAT TAAGTTTGTC GGTCATATAT 

5 ' 61 71 81 91 

t TGCATATATG GG TGTTACAT TGCAAAAATC TCTATCTTTGTTCTATTCAC 
I 5 5 1 ACGTATATAC CCACAATGTAACGTTTTTAG AGATAGAAAC AAGATAAGTG 

5 • 11 21 31 41 

^ ATGCTTAAAGAAGTAAGAAATCTTTTGTGGATATGTAATTATACATATAA 
3=6 0 1 -pACGAATTTC TTCATTCTTT AGAAAACACC TATACATTAATATGTATATT 

5' 61 71 81 91 

AGTATATATATATGTATGATACATGAAATATATTTAGAAATGTTCATAAT 
3651 TCATATATAT ATACATACTA TGTACTTTAT ATAAATCTTT ACAAGTATTA 

5' 11 21 31 41 

_ „ n _ TTTAATGGAT ATTCTTTGGT GTGAATAATT GAATACAACATTTTTAAAAT 
AAATTACCTA TAAGAAACCA CACTTATTAA CTTATGTTGT AAAAATTTTA 

5' 61 71 81 91 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAATTTTTT TTTTTTTTTT 
3 751 TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTAAAAAA AAAAAAAAAA 

5' 11 21 31 41 

ft n TTATTCCAGAGATTAAAGAC ACTAGATCTT TAACCTTGAA GGGCAGGCAA 
J AATAAGGTCT CTAATTTCTG TGATCTAGAA ATTGGAACTT CCCGTCCGTT 

5' 61 71 81 91 

- q-- GAGGTCGG CAATGCTGTCAA CATAGAAGTC AGGGACCATT TTCTTCTTGA 
CTCCAGCCGT TACGACAGTT GTATCTTCAG TCCCTGGTAA AAGAAGAACT 
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3901 



11 21 31 41 

ACATGCAGTCACTTTCCTGATTGCTCTTCACATCCTCAAGGCTCCGGAA.T 
TGTACGTCAG TGAAAGGACT AACGAGAAGT GTAGGAGTTC CGAGGCCTTA 



5' 61 71 81 91 

Q TCCGGGGGTG TGGTGGGCTT TGATCTCAGG ACTCTGGAGG CAGAAGCAGG 
AGGCCCCCACACCACCCGAAACTAGAGTCCTGAGACCTCCGTCTTCGTCC 

5" 11 21 31 41 

CAGATCTCTG TGAATATGAG GCCAGCCTGC ACTACACAGA GCTCCAGACC 
4 U U X qtCTAGAGACACTTATACTC CGGTCGGACG TGATGTGTCT CGAGGTCTGG 

5' 61 71 81 91 

. _ j- _ AGTCATGGCT ACATCATGAAACCCTGTCTCAAAAAGAAAATAAAAACTGT 
4 U b 1 TCAGTACCGATGTAGTACTTTGGGACAGAGTTTTTCTTTTATTTTTGACA 

0 5 ' 11 21 31 41 

^ TGTGTTTCTA CCATAGTGTT AAACTCAGAG TCTGAGTAAT GTCGGGCTGA 
AC^C^U\AGATGGTAT(^CAATTTGAGTCTCAGACTCATTAC^GCCCGACT 



HU 5 ' 61 71 81 91 

^ CATGCTCGGG TGTTTAACATACCTTCAGCT TTGACGAGGC GCTGAACAGT 
^ GTACGAGCCCACAAATTGTATGGAAGTCGAAACTGCTCCGCGACTTGTCA 

'f 5' 11 21 31 41 

^ CAAAGTCTGG CCTTGGGGAG CGGTGGCTGT GTTTGTGCTC AAGTCCACCG 

3 GTTTCAGACC GGAACCCCTC GCCACCGACA CAAACACGAG TTCAGGTGGC 

□ 5' 61 71 81 91 

^ TGAAATCCTGATTGTGAATTTGGACAACCG TGTCCTTCTT CTTGGCCTTC 
ACTTTAGGAC TAACACTTAAACCTGTTGGC ACAGGAAGAAGAACCGGAAG 

5' 11 21 31 41 

^ n CATGCAACCTCCAACTTCATGTTGGTCATTTTGTCAAAACACTGTGTGAT 
GTACGTTGGA GGTTGAAGTA CAACCAGTAAAACAGTTTTG TGACACACTA 

5' 61 71 81 91 

- -j j-_ GTTTTTATCAATATACTGCC ATTCCACATATGTAGAGATG TAGTCTGCCT 

4 "* 13 - 1 CAAAAATAGT TATATGACGG TAAGGTGTAT ACATCTCTAC ATCAGACGGA 

5' 11 21 31 41 

.. n GGCTTTCCTT TTCTTTAGCC AATCGAATGC TCTTGATCAT GCCCTCAATC 
44 UJ - CCGAAAGGAAAAGAAATCGGTTAGCTTACGAGAACTAGTACGGGAGTTAG 

5' 61 71 81 91 

. ^ TCATCTCTAG CTTTTATCAC GTCTCTGCTAATTCCTGAAA CTTGAATCGA 
4 4 15 X AGTAGAGATC GAAAATAGTG CAGAGACGAT TAAGGACTTT GAACTTAGCT 

5 1 11 21 31 41 

. t- n , AGTTTTCTTC TGGTTCATCT CAATGGTGAT GTTCAGTTCC TTCTGAATCT 
4 b U X TCAAAAGAAG ACCAAGTAGAGTTACCACTA CAAGTCAAGG AAGACTTAGA 
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5* 61 71 81 91 

" 4 551 CATTCAGTTT CTCGTACTCC TCCATGTCAAAGTCACTGAC ACACTCATCG 
GTAAGTCAAA GAGCATGAGG AGGTACAGTT TCAGTGACTG TGTGAGTAGC 

5' 11 21 31 41 

TCATTGGTGT AGGAAAGCTG CTCTTTGGTAATCAGTTCCT TTAGCCAGGA 
AGTAACCACATCCTTTCGACGAGAAACCATTAGTCAAGGAAATCGGTCCT 

5' 61 71 81 91 

4 - 5 - GATTGTTTTG TTCACACTGT CTACCCCTGA ACCACATACC TGGAAAACTG 
CTAACAAAACAAGTGTGACAGATGGGGACTTGGTGTATGGACCTTTTGAC 

5' 11 21 31 41 

? 0 TGTGCTCTAT TTTCTTTTCC AAAACCAGGG TGTTCTTTTT GGGGGAAGCT 
3 ACACGAGATAAAAGAAAAGGTTTTGGTCCCACAAGAAAAACCCCCTTCGA 

5 5' 61 71 81 91 

^ 4751 TGCTTGGGAAAGCCAAGAAAGGCTAAAGAGAAAATGGAAATTAATGTTTC 
3 ACGAACCCTT TCGGTTCTTT CCGATTTCTC TTTTACCTTT AATTACAAAG 

I 5' 11 21 31 41 

^ 4801 TTTTACTCCC TTCAACATCAAGGTTAGGAA TATGTATTTC ATAAAAGCTA 

; " AAAATGAGGG AAGTTGTAGT TCCAATCCTT ATACATAAAG TATTTTCGAT 

Z 5' 61 71 81 91 

ACAACTCACAGGCAATCTTAGACATCACTGACTGCTTGGCAGGCGACTGC 
| TGTTGAGTGT CCGTTAGAAT CTGTAGTGAC TGACGAACCG TCCGCTGACG 

i 5' 11 21 31 41 

== 4901 TTGGGGGGAG CTGGAGAGCC TTCTCTTTCT TTCATGTTGT CGTAAAAAAA 
AACCCCCCTCGACCTCTCGGAAGAGAAAGAAAGTACAACAGCATTTTTTT 

5' 61 71 81 91 

4 g5 - TTGCAGAATA TGGGGCTGGAAGATAACAAC TTTAACTCTC TTCACAGCCT 
AACGTCTTATACCCCGACCTTCTATTGTTGAAATTGAGAGAAGTGTCGGA 

5' 11 21 31 41 

5 0 0 - GCACTGATTT TTTCTGGACAAATTCTTCAATGGCATCTAT TATCGCTTTT 
CGTGACTAAAAAAGACCTGTTTAAGAAGTTACCGTAGATAATAGCGAAAA 

5' 61 71 81 91 

GCTACTACGT TTGGGTCCTGTTGAGCATTT CCTTCAAAAACAAAAAAAGC 
CGATGATGCAAACCCAGGACAACTCGTAAAGGAAGTTTTTGTTTTTTTCG 

5" 11 21 31 41 

5 - 0 - ACATTTTTAAAAAGTCAAGGTTAAGATCCACCTGCAAAAAAAAGCTGCAA 
TGTAAAAATTTTTCAGTTCCAATTCTAGGTGGACGTTTTTTTTCGACGTT 

5' 61 71 81 91 

5 1 5 1 TAT ^ GCGAGG -^ TTCT -AGTTGTCACAGGAAATAAAAATGTCTGTTCCCA 
ATATTCGCTC CTTAAGATCAACAGTGTCCT TTATTTTTAC AGACAAGGGT 
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5' 11 21 31 41 

CTATAATCAATGTAGACTGATAATATTATG CCAGCAAATA GTTTTGAAGT 
. 52 01 GATA TTAGTT ACATCTGACT ATTATAATAC GGTCGTTTAT CAAAACTTCA 

5« 61 71 81 91 

CCTAGGCACAGTGGGAGGAGGTTTTGTTCCACGCTGTTCATAAGCCAATA 
5251 ggatccGTGT CACCCTCCTC CAAAACAAGG TGCGACAAGT ATTCGGTTAT 

5- 11 21 31 41 

CCCCAGCAAAAGACCTTAAAGGACAACTTG TAATTTGGGA CATTCACATC 
5301 GGGGTCGTTT TCTGGAATTT CCTGTTGAAC ATTAAACCCT GTAAGTGTAG 

5> 61 71 81 91 

TGTCCTCTTCATCTGATCTGGCTCCCAGTGTCACTCTCTAACACGGTCCT 
i 3 5351 ACAGGAGAAG TAGACTAGAC CGAGGGTCAC AGTGAGAGAT TGTGCCAGGA 

M 5 1 11 21 31 41 

i 5o TAGAGGGACAATTTATCCCT GCCTCTGCTT GATCTTATGC ATGTATCTGT 

□ 5401 ATCTCCCTGTTAAATAGGGACGGAGACGAACTAGAATACGTACATAGACA 

|=5 5' 61 71 81 91 

S ATTCTTCCAG CCATCCCTGG CGACCTGATT TTTCTAAGGC ACCCAAAACT 

- 5451 TAAGAAGGTC GGTAGGGACC GCTGGACTAA AAAGATTCCG TGGGTTTTGA 

■=i 5 » 11 21 31 41 

^ GTAAGCTACT TCTTATAATC TATAATTCTG AGCATATTAG TTAGCCTGAG 

.= 5501 QATTCGATGAAGAATATTAGATATTAAGAC TCGTATAATC AATCGGACTC 

*; 5 • 61 71 81 91 

CCTCCAGGAT ATCTTTCTTC CCTATACTCAGTCCAGTTTTAGCTGCCCAG 
5551 GGAGGTCCTA TAGAAAGAAG GGATATGAGT CAGGTGAAAATCGACGGGTC 

5' 11 21 31 41 

AAGGATTCAAAGCTGATCTACGAGTAGATCACTCCTGTCTACAGCTTGTT 
5601 <p>pCCTAAGTT TCGACTAGAT GCTCATCTAG TGAGGACAGATGTCGAACAA 

5' 61 71 81 91 

_ CCAGATCTTG TTTCTGAAGC CCTGGAAGCC ATCAGCCAGG TAAGATTGTA 
GGTCTAGAACAAAGAGTTCGGGACCTTCGGTAGTCGGTCCATTCTAACAT 

5' 11 21 31 41 

_ AAACAATCCC TTTCTAATCATGGGTGTGGC CCAAAGTGAA TGGCCGGAAT 
5 701 'pTTGTTAGGGAAAGATTAGTACCCACACCGGGTTTCACTTACCGGCCTTA 

5 ' 61 71 81 91 

TG 

5751 
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Figure 15 



PDElOa and RACEs compiled 

1 CGCCCGGGCA GGTCTGTTGG AGGGCAGTTG GTCAACCTGA CCAGAGAGAG CTGAGCTGGA 
GCGGG CCCGT _CCAGACAACC TCCCGTCAAC CAGTTGGACT GGTCTCTCTC GACTCGACCT 
61 AGACCCCACT GATGGTGTGC TGCCTTTCAG TCCAGGAAGA AAGAAAGGAA GGATTCTGAG 
_ TCTGGGGTGA CTACCACACG ACGGAAAGT C AGG TCCTTC T TTCTTTCCTT CCTAAGACTC 
121 GATTTGGGCA AAGCCACATT CCTGGAGAAG TCTGTATACT GATGCCAAAC CCAAGAGCTG 

CTAAACCCGT TTCGGTGTAA GGACCTCTTC AG AC AT A T G A CTACGGTTTG GGTTCTCGAC 

181 AGCTGCTGAT GAGGCCCAGG GAGTAGCCCA CGCGCCCTGA GCTGTTGGCT AGCAAGGCCT 
T CGACGACTA CTCCGGGTCC CTCATCGGGT GC GCGG GACT CGACAACCGA TCGTTCCGGA 
241 TCCTGCTCCA TGTGGCATGG AAAAATTATA TGGTTTGACG GATGAAAAGG TGAAGGCCTa" 
A GGACG AGGT ACACCGTACC TTTTTAATAT ACCAAACTGC CTACTTTTCC ACTTCCGGAT 
301 TCTTTCTCTC CATCCCCAGG TATTAGATGA ATTTGTTTCT GAAAGTGTTA GTGCAGAGAC 
A GAAAGAGAG GTAGGGGTCC ATAA TCTACT TAAACAAAGA CTTTCACAAT CACGTCTCTG 
361 TGTGGAAAAG TGGCTGAAGA GGAAAACCAA CAAAGCAAAA GATGAACCAT CTCCCAAGGA 

ACACCTTTTC ACCGACTTCT C C TTTTGG TT G TTTCGTTTT CTACTTGGTA GAGGGTTCCT 

4 21 AGTCAGCAGG TACCAGGATA CGAATATGCA GGGAGTCGTG TACGAGCTGA~ACAGCTACAT 

TCAGTCGTCC ATGGTCCTAT GCTTATACGT CCCTCAGCAC AT GC T CG AC T TGTCGATGTA 

461 AGAGCAGCGC CTGGACACGG GCGGGGACAA CCACCT<3CTC CTCTATGAGC TCAGCAGCAT 

TCTCGTCGCG GACCTG TGCC CGCCCCTGTT GGTGGACGAG GAGATACTCG A GTCGTCGTA 

541 CATCAGGATA GCCACAAAAG CCGACGGATT TGCACTGTAC TTCCTTGGAG AGTGCAATAA 
G TAGTC C TAT CGGTGTTTTC GGCTGCCTAA ACG TGACATG AAGGAACCTC TCACGTTATT 
601 TAGCCTGTGT GTGTTCATAC CACCCGGGAT GAAGGAAGGC CAACCCCGGC TCATCCCTGC 
ATCGGACACA CACAAGTATG GTGGGCCCTA CTTCCTTCCG GTTGGGGCCG AGTAGGGACG 
661 AGGGCCCATC ACCCAGGGTA CCACCATCTC TGCCTACGTG GCCAAGTCTA GGAAGACGTT 
TCCCGGGTAG TGGGT CCCAT GGTGGTAGAG ACGGATGCAC CGGT TCAG AT CCTTCTGCAA 
" " EcoRV" Xhoi 

721 GTTGGTAGAG GATATCCTTG GGGATGAGCG ATTTCCTCGA GGTACTGGCC TGGAATCAGG 
CAACCATCTC CTATAGGAAC CCCTACTCGC TAAAGGAGCT CCATGACCGG ACCTTAGTCC 
781 AACCCGCATC CAGTCTGTTC TTTGCTTGCC CATTGTCACT GCCATTGGAG ACTTGATTGG 
TTGGGCGTAG GTCAGACAAG AAACGAACGG GTAACAGTGA CGGTAACCTC TGAACTAACC 
841 CATCCTTGAA CTGTACAGGC ACTGGGGCAA AGAGGCCTTC TGCCTCAGCC ATCAGGAGGT 
GTAGGAACTT GAC ATGTC GG _TGACCCCGTT_ TCJ'CCGGAAG ACGGAGTCGG TAGTCCTCCA 
901 TGCAACAGCC AATCTTGCTT GGGCTTCCGT AGCAATACAC CAGGTGCAGG TGTGTAGAGG 
ACGTTGTCGG TTAGAACGAA CCCGAAGGCA TCGTTATGTG GTCCACGTCC ACACATCTCC 
961 TCTCGCCAAA CAGACCGAAC TGAATGACTT CCTACTCGAC GTATCAAAGA CATACTTTGA 
AGAGCGGTTT GTCTGGCTTG ACTTACTGAA GGATGAGCTG CATAGTTTCT GTATGAAACT 
1021 TAACATAGTT GCCATAGACT CTCTACTTGA ACACATCATG ATATATGCAA AAAATCTAGT 
ATTGTATCAA CGGTATCTGA GAGATGAACT TGTGTAGTACJTATATACGTT TTTT AGATC A 
1081 GAACGCCGAC CGCTGCGCGC TCTTCCAGGT GGACCACAAG AACAAGGAGC TGTACTCGGA 
CTTGCGGCTG GCGACGCGCG AGAAGGTCCA CCTGGTGTTC TTGTTCCTCG ACATGAGCCT 
1141 CCTGTTTGAC ATTGGGGAGG AGAAGGAGGG GAAGCCCATC TTCAAGAAGA CCAAGGAGAT 
GGACAAACTG TAACCCCTCC TCTTCCTCCC CTTCGGGTAG AAGTTCTTCT GGTTCCTCTA 
1201 CAGATTTTCC ATTGAGAAAG GGATTGCTGG TCAAGTGGCA AGAACAGGCG AAGTCTTGAA 
GTCTAAAAGG TAACTCTTTC CCTAACGACC AGTTCACCGT TCTTGTCCGC TTCAGAACTT 
12 61 CATTCCCGAT GCCTACGCGG ACCCTCGCTT TAACAGGGAG GTGGACCTGT ACACAGGCTA 
GTAAGGGCTA CGGATGCGCC TGGGAGCGAA ATTGTCCCTC CACCTGGACA TGTGTCCGAT 
1321 CACCACGAGG AACATTCTGT GTATGCCCAT AGTGAGCCGA GGCAGCGTGA TTGGCGTGGT 
GTGGTGCTCC TTGTAAGACA CATACGGGTA TCACTCGGCT CCGTCGCACT AACCGCACCA 
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PDElOa and RACEs compiled 

1381 GCAGATGGTG AACAAGATCA GCGGTAGCGC CTTCTCCAAG ACAGACGAGA ACAACTTCAA 
CGTCTACCAC TTGTTCTAGT CGCCATCGCG GAAGAGGTTC TGTCTGCTCT TGTTGAAGTT 

BamHi 

14 41 GATGTTTGCT GTCTTCTGCG CACTGGCCTT GCACTGTGCT AACATGTACC ACAGGATCCG 
CTACAAACGA C AGAA GACGC GTGACCGGAA CGTGACACGA TTGTACATGG TGTCCTAGGC 

" ' Hindllf " 

1501 CCACTCAGAA TGCATCTACA GGGTTACCAT GGAGAAGCTT TCCTACCACA GCATCTGCAC 

GGTGAGTCTT AC G TAG AT G T_ CCCAATGGTA CCTCTTCGAA AGGATGGTGT CGTAGACGTG 
1561 CTCCGAGGAG TGGCAAGGCC TCATGCGCTT CAACCTACCA GCACGCATCT GCCGGGACAT 

GAGGCTCCTC ACCGTTCCGG AGTACGCGAA GTTGGATGGT CGTGCGTAGA CGGCCCTGTA 
1621 CGAGCTATTC CACTTTGACA TTGGTCCTTT C G AG AAC AT G TGGCCTGGGA TCTTTGTCTA 

GC TCGATAAG GTGAAACTGT AACCAGGAAA GCTCTTGTAC ACCGGACCCT AGAAACAGAT 
1681 CATGATCCAT CGGTCTTGTG GGACATCCTG TTTTGAACTT GAAAAATTGT GCCGTTTTAT 

GTA CTAGGTA GCCAGAACAC CC TG TAGGAC A AAACTTG AA C TTTTTAACA CGGCAAAATA 
1741 CATGTCTGTG AAGAAGAACT ATCGGCGGGT TCCTTACCAC AACTGGAAGC ATGCAGTCAC 

GTACAGACAC TTCTTCTTGA_ TAG CCGCCCA AGGAA TGGTG TTGACCTTCG TACGTCAGTG 

Xhoi 

1801 GGTGGCACAC TGCATGTATG CCATACTTCA AAACAACAAT GGCCTCTTCA CAGACCTCGA 

CCACCGTGTG ACGTACATAC GGTATGAAGT TTTGTTGTTA CCGGAGAAGT GTCTGGAGCT 

" ~ XhoF 
w 

18 61 GCGCAAAGGC CTGCTAATTG CGTGTCTGTG CCATGACCTG GACCACAGGG GCTTCAGTAA 

CGCGTTTCCG GACGATTAAC GCACAGACAC GGTACTGGAC CTGGTGTCCC CGAAGTCATT 
1921 CAGCTACCTG CAGAAGTTCG ACCACCCCCT GGCGGCGCTG TACTCCACCT CCACCATGGA 

GTCGATGGAC GTCTTCAAGC TGGTGGGGGA CCGCCGCGAC ATGA GG TGGA GGTGGTACCT 
1981 GCAACACCAC TTCTCCCAGA CGGTGTCCAT CCTTCAGCTG GAAGGGCACA ATATCTTCTC 

CGTTGTGGTG AAGAGGGTCT GCCACAGGTA GGAAGTCGAC CTTCCCGTGT TATAGAAGAG 
2041 CACCCTGAGC TCCAGCGAGT ACGAGCAGGT GCTGGAGATC ATCCGCAAAG CCATCATCGC 

GTGGGACTCG AGGTCGCTCA TGCTCGTCCA CGACCTCTAG TAGGCGTTTC GGTAGTAGCG 
2101 CACCGACCTC GCCCTATACT TTGGGAACAG GAAGCAGTTG GAGGAGATGT ACCAGACAGG 

GTGGCTGGAG CGGGATATGA AACCCTTGTC CTTCGTCAAC CTCCTCTACA TGGTCTGTCC 
2161 GTCGCTGAAC CTCCACAACC AG TCCCATCG AGACCGTGTC ATCGGCTTGA TGATGACTGC 

CAGCGACTTG GAGGTGTTGG TCAGGGTAGC TCTGGCACAG TAGCCGAACT AC TACTGACG 
2221 CTGTGATCTT TGCTCTGTGA CCAAACTATG GCCAGTTACA AAATTGACAG CGAATGATAT 

GACACTAGAA ACGAGACACT GGTTTGATAC CGGTCAATGT TTTAACTGTC GCTTACTATA 
EcoRI " 

2281 ATATGCAGAA TTCTGGGCTG AGGGTGATGA GATGAAGAAG CTGGGCATAC AGCCCATTCC 

TATACGTCTT AAGACCCGAC TCCCACTACT CTACTTCTTC GACCCGTATG TCGGGTAAGG 
2341 TATGATGGAC AGAGACAAGC GAGATGAAGT CCCTCAAGGG CAGCTCGGAT TCTACAATGC 

ATACTACCTG TCTCTGTTCG CTCTACTTCA GGGAGTTCCC GTCGAGCCTA AGAT GTTA CG 
2401 TGTGGCCATT CCCTGCTATA CCACCTTGAC GCAGATCCTC CCACCCACAG AGCCTCTGCT 

ACACCGGTAA GGGACGATAT GGTGGAACTG CGTCTAGGAG GGTGGGTGTC TC GGAGACGA 
24 61 GAAGGCCTGC AGGGATAACC TCAATCAGTG GGAGAAGGTA ATTCGCGGGG AAGAGACAGC 

CTTCCGGACG TCCCTATTGG AGTTAGTCAC CCTCTTCCAT TAAGC GCCCC TTCTCTGTCG 
2 521 AATGTGGATT TCAGGCCCAG GCCCGGCGCC TAGCAAGAGC ACACCTGAGA AGCTGAACGT 

TTACACCTAA AGTCCGGGTC CGGGCCGCGG _ATCGTTCTCG_ TGTGGACTCT T CGACTTGCA 
2581 GAAGGTTGAA GACTGATCCT GAAGTGACGT CCTGATGTCT GCCCAGCAAC CGACTCAACC 

CTTCCAACTT CTGACTAGGA CTTCACTGCA GGACTACAGA CGGGTCGTTG G CTGAGTTGG 
264 1 TGCTTCTGTG ACTTCGTTCT TTTTGTTTTC AAGGGGTGAA AACCCCCTGT CAGAAGGTAC 

ACGAAGACAC TGAA GCAAGA AAAACAAAAG TTCCCCACTT TTGGGGGACA G TCTTCCATG 
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PDElOa and RACES compiled 

27 01 CGTCGCATAT CCATGTGAAG CAGACGACTC CCTGCTTGCC GCACACACCT CGGACAGTGA 
G CAGC GTATA GGTACACT TC GTCTGCTGAG GGACGAACGG CGTGTGTGGA GCCTGTCACT 

27 61 GCAACCCAGG CTCTGCCGTG TTCAGACGTC GGCTACTCCG TGGCTCCACC TGACCTCCGA 

CGTTGGGTCC GAGAC GGCAC AAGTCTGCAG CCGATGAGGC ACCGAGGTGG_ ACTGGAGGCT 
2821 ATGCTATTTG CTCCCAGGCC AGCACTGCAC TGTCTGGAGG GGGCAGAGAC CACA^GAGAG~ 
TAC GATAAAC GAGGGTCCGG TCGTGACGTG ACAGACCTCC CCCGTCTCTG GTGTCCTCTC 

28 81 GTTCTTGCCT GCATCCTCCC ATGAGGGTGT GGCCAGTTCC CTAGTTCTGT GCCATGCTGC 
C AAGAACGGA CGTAGGAGGG TA CTCC CACA CCGGTCAAGG GATCAAGACA CGGTACGACG 
TGCTTGGTGG CATTGGTTAG GAATGGGACA CACGCCCCTT GTTGTGAAGT TTACATGTGA 
AC GAACCACC GTAACCAAT C CTTACCCTGT GTGCGGGGAA CAACACTTCA AATGTACACT 
CCTTCTTATA GGTTAACTGA GTTTGTGGCC TGGACACATG TAATGAAGGT CACAGTCCAC 
GGAAGAATAT CCAATTGACT CAAACA CCGG ACCTGT GTA C ATTACTTCCA GTGTCAGGTG 
AGGTGACAGA GAAATCCAAA CTGTTGATTA CAGGTGCACT ACAGGTATGC TCTTTCAGTC 

_ T CCACTGTCT CTTTAGGTTT GACAACTAAT GTCCACGTGA TGTCCATACG AGAAAGTC AG 
3121 TATCTGGGGG CACATAGGTG AGTCTGCTCC ACTCAGAANN AAGCATACCT CTGCCCTCAT~ 

ATAGACCCCC GTGTATCCAC TCAGACGAGG TGAGTCTTNN TTCGTATGGA G ACGGGAGTA 

3181 CCAGGGGACA CAGGGTACAT CCCAGGCATC GGGGAACTGA AGCTCTCACT TCAAACCATG 
GGTC CCCTGT GTCCCATGTA GGGTCCGTAG CCCCTTGACT TGGAGAGTGA AGTTTGGTAC 
3241 TCAAAGAATT AAAACACCTC CCCTCCCCCT CACTGTAGCC TTCGACAACT GCGCCAATCC 
AGTTTCTTAA TTTTGTGGAG GGGAGGGGGA GTGACATCGG AAGCTGTTGA CGCGGTTAGG 
3301 CTTTATACAA AGAAAATAAA AGTAAGGCAT ATAAATTTCC TCCAGCAAGC AAATCTTGTG 
GAAATATG TT TC T TTTATT T TCATTCCGTA TATTTAAAGG AGGTCGTTCG TTTAGAACAC 
3361 GGTAAAAAAA AAGCATGTGA ATNNTAACAA CNTCTANANT NTCNCNGNAT GTTATGGCAG 
CCATTTTTTT TTCGTACAC T TANNATTGTT GNAGATNTNA NAGNGNCNTA CAATACCGTC 
34 21 AATTTTAGTC ACGTCCAAAA CAAAAAGATT ATTCCAGAAG ATACCTCATC CTATGCCTGA 
TTAAAATCAG T GCAGGTTT T GTTTTTCTAA TAAGGTCTTC TATGGAGTAG GATACGGACT 
34 81 AAGGCTCCAC AGCATGGCGT CCGTCTCCCA GGGTTCTGAT CCGTCTCCTC" ACGGTGCAAT 
TTCCGAGGTG TCGTACCGCA GGCAGAGGGT CCCAAGACTA G GCA GAGGAG TGCCACGTTA 
3541 CAGGCAGGAC AGAGAGGAGG GCTGCAGGGC TACCACATTG ACCCAGAAGG TATCTCCTCT 
GTCCGTCCTG TCTCTCCTCC CGACGTCCCG ATGGTGTAAC TGGGTCTTCC ATAGAGGAGA 
3601 CACCAT TCAG ACATCCATAA GGAATGCCAA ATGCTGTATT GAATAGTTCT CTGTGTGACT 
GTGGTAAGTC TGTAGGTATT CCTTACGGTT TACGACATAA CTTATCAAGA GACACACTGA 
Xbaf 

3661 TTCTAGAGAA GCCAGGACAC CCTGAGCCTT TCCNGGGGAA CTCTAAGGAG TCACAGGTTC 

AAGATCTCTT CGGTCCTGTG GGACTCGGAA AGGNCCCCTT GAGATTCCTC AGTGTCCAAG 
3721 ACACCGTGGG GATTTTCAGG ATAGCATGGA GACAGAGATC CGGTCGTTGT TCTCACTCGT 

TGTGGCACCC CTAAAAGTCC TATCGTACCT CTGTCTCTAG GCCAGCAACA AGAGTGAGCA 
37 81 GAGCCTTGAG AAGGAGAGAC TGACCAGAAA CACTCACTCA GCACTCTGCA GGAGCAGGAG 

CTCGGAACTC TTCCTCTCTG ACTGGTCTTT GTGAGTGAGT CGTGAGACGT CCTCGT CCTC 
3841 AAGATACTTT AAGATGAATC TTGGATAGAT TTTGATACAC CCAATACCAT ACACACAGGA 

TTCTATGAAA TTCTACTTAG AACCTATCTA AAACTATGTG GGTTATGGTA TGTGTGTCCT 
3901 GCTTGGCATT TGCAAAGTCT ATTCAGTTTC CTTCCGCGCT CTGACCCACG GTTGTAGCGG 

CGAACCGTAA ACGTT TCAGA TAAGTC AAA G GAAGGCGCGA GACTGGGTGC CA ACATCGC C 
3961 AGTGGGCTGA ACACTGTAAC ACTGTACATG CGATTTCCCC ATGGGCTTCT AAAATGTCAC 

TCACCCGACT TGTGACATTG TGACATGTAC GCTAAAGGGG TACCCGAAGA TTTTACAGTG 

4 021 CATCTCCTCC CCTGCTGTGT CCTACTCCAT TTACTGGTTA CAAGGTGATG TCAACAAGAG 
GTAGAGGAGG GGACGACACA GGATGAGGTA AATGACCAAT GTTCCACTAC AGTTGTTCTC 



2941 
3001 
3061 
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Figure 1 5 (con't) 



PDElOa and RACES compiled 

4 081 AAGCTATCAC AACACCAGGG CTGTGCACAC GTGCACACAC ATGTATGCAC AAGCACACAG 

TTCGATAGTG TTGTGGTCCC GACACGTGTG CACGTGTGTG TACATACGTG TTCGTGTGTC 
4 14 1 ATGTATGTAC AGCACACACA CACACACACA CCCCAAAAGG AGAGAAAAGG AAGAAAACAT 

TACATACATG TCGTGTGTGT GTGTGTGTGT GGGGTTTTCC TCTCTTTTCC TTCTTTTGTA 
4 201 TTATAAAAAG CGACAGCTAC CCCATATCAA AATAGTCTTT CCTGTAGGAA ACAGGAGCTC 

AATATTTTTC GCTGTCGATG GGGTATAGTT TTATCAGAAA GGACATCCTT TGTCCTCGAG 
42 61 TCCATAAGGA ATTATCATGA GTGTGTTCTC CCATCAGTGC ACTCTCCCAG GGGTGCTCAC 

AGGTATTCCT TAATAGTACT CACACAAGAG GGTAGTCACG TGAGAGGGTC CCCACGAGTG 
4 321 TGAAGCTGGT CCACRTCTAT AAACAGGTGA CACTGGCTGC AGCAAAAAGC CAT TCGATCC 

ACTTCGACCA GGTGRAGATA TTT GTCCACT GTGACCGACG TCGTTTTTCG GTAAGCTAGG 
4 381 ACACAAATTG ATCTTCTATC ATCTTGGAAT CTGAATTGCA GGGAGGAGCA GYATGTAAGA 

TGTGTTTAAC TAGAAGATAG TAGAACCTTA GACTTAACGT CCCTCCTCGT CYTACATTCT 

4 441 CGACCGTTTA ATTCAGGCAT TCCGAAGGCA TGAGCGCATG GATTCTRTCA CCAAGCGTAT 

GCTGGCAAAT TAAGTCCG TA AGGCTTCCGT ACTCGCGTAC CTAAGARAGT GGTTCGCATA 
4501 AAAAGGACCC TGGCATTGGG AAACCTATGA CGGACTGTTT TTGCTGTAGA AGTAGGGATT 

TTT TCCTGGG ACCGTAACCC TTTGGATACT GCCTGAC AAA AACG ACATCT TCATCCCTAA 
4561 TTACAGAAGT CTCCTTGRAT TTGCCCTGCC TGGGGCAGTT TTGCAGAGGA ACCTGCCAGA 

AATGTCTTCA GAGGAACRTA AACGGGACGG ACCCCGTCAA AACGTCTCC T TGGACGGTCT 
4 621 GATTTATTGG CTGGTCAGTC TCTTGTGAAA TAGTATCATG TGAGAAACAG TTTGTAGAAA 

CTAAATAACC GACCAGTCAG AGAACACTTT ATCATAGTAC ACTCTTTGT C AAACATCTTT 
4 681 AAAACTATAC CTGGGAAGAC CTTTGCAACA TTGTTCCTTC CATGGGCCAA GACTCAGTTA 

TTTTGATATG GACCCTTCTG GAAACGTTGT AACAAGGAAG GTACCCGGTT CTGAGTCAAT 
4 741 GGAGGCATAA ATCTGCCCGG AATAAACTAG GCCAGGATAC AGCCATGTTT AGTTAATAAT 

CCTCCGTATT TAGACGGGCC TTATTTGATC CGGTCCTATG TCGGTACAAA TCAATTATTA 

... _ . 

vwnvwvww 

4 801 TTGGTTTTAG AATTCACACA GGCAGGATTG GTTTTTTTGT GTCTTGGCAA GTGGAGCATA 

AACCAAAATC TTAAGTGTGT CCGTCCTAAC CAAAAAAACA CAGAACCGTT CACCTCGTAT 
4 8 61 TTTAACATAC AGGCATGGGA ATCCTGCCTC TTAGCTTTTC CCACCCTCTT GTCT CACCAA 

AAATTGTATG TCCGTACCCT TAGGACGGAG AATCGAAAAG GGTGGGAGAA CAGAGTGGTT 
4 921 GTTTTTTCTC TCCAAAGGTT TCCAGGAATT TCTCATTAAT GGCTGATGCA AACTTAGTGA 

CAAAAAAGAG AGGTTTCCAA AGGTC CTTAA AGAGTAATTA CCGAC TACGT TTGAATCACT 
4 981 ATAATAATGA AXATAAACAA TGCTCACCTC ACCAAAATTA TATTATTTGC AGTCATTTGT 

TATTATTACT TATATTTGTT ACGAGTGGAG TGGTTTTAAT ATAATAAACG TCAGTAAACA 
5041 GATAACACAA ATTTTATCGC AATGGTTATT ATTTAATTTG TGGCCACACA CTGTGGTTAT 

CTATTGTGTT TAAAATAGCG TTACCAATAA TAAATTAAAC ACCGGTGTGT GACACCAATA 
5101 CTTTTGTTGT GGTTGTTTCT GAGAAAATGT TCTTGGATAT GTAAGTGCCA ATACCAGTGT 

GAAAACAACA CCAACAAAGA CTCTTTTACA AGAACCTATA CATTCACGGT TA£GGTCACA_ 
5161 GAAGTATTGA TCCCGGGCAG CAAAATACAG CCTAAGGTTT GTAAACATCA ATTCTATCTC 

CTTCATAACT AGGGCCCGTC GTTTTATGTC GGATTCCAAA CATTTGTAGT TAAGATAGAG 
5221 AGTTCATCAG AGGGCCTGAG AAGCTGCGGG GCAG TGT AAA GTAAAGTATG CTGGGCTGGT 

TCAAGTAGTC TCCCGGACTC TTCGACGCCC CGTCACATTT CATTTCATAC GACC CGAC CA 
5281 GGTGGTCAGC CTCCCCTTGC CAAGAAGAGA GCAATTGAAT CCTGTCCCCA GCTCCCTCCA 

CCACCAGTCG GAGGGGAACG GTTCTTCTCT CGTTAACTTA GGACAGGGGT CGAGGG AGG T 
5341 CGCCTGAAGA GTGACCAGTG CTGGCCCGAC GGATCGCTGA GATATTCTCC CATAATGGCA 

GCGGACTTCT CACTGGTCAC GACCGGGCTG^ CCTAGCGACT CTATAAGAGG G TATTACCGT 
54 01 AAAAAATAGG CAGTTTGATG TGACCTGTTT AGTGTGGCTC TCCTCTTTTG AGCATGTGTT 

TTTTTTATCC GTCAAACTAC ACTGGACAAA TCACACCGAG AGGAGAAAAC TCGTACACAA 
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PDElOa and RACES compiled 

54 61 AGCATTTTTA TTTTATACTC ATCCAGTGAA CTCTGCTCTT CCAAGTGTGT TCATGTATGT 
TCG TAAA AAT AAAATA TGAG TAGGTCACTT GAGACGAGAA GGTTCACACA AGTACATACA 
5521 GCTAGATATA TTAGCACAGC CTGCCTTCTG CTGCACAACG CCTTAGAGAC CCG~GCCTTTC 
CGATCTATAT AATCGTGTCG G AC GGAAGAC GACGTGTTGC GGAATCTCTG GGCCGGAAAG 
5581 AATGAGCTTA GCTTGTGCTC TGTTTCTGCT CTCTTAGGTC TAAACTATGG TGTCAGTTTT 
TTACTCGAAT CGAACACGAG ACA AAGACGA GAGAATCCAG ATTTGATACC ACAGTCAAAA 
5641 AATAGAACAA AAGTATGCAT CTTGCCTTGG CTTGAGCCTT TTCGTTTTCA ATGCTGACTT 
TTATCTTGTT TTCATAC GTA GAACGGAACC GAACTCGGAA AAGCAAAAGT TACGACTGAA 
57 01 CTCCCCTTTC TCTCCTGTGC TCACCTTACC TTTCCAGAGT GTAAGGGACA ACTTTTAAGG 
GAGGGGAAAG AGAGGACACG AGTGGAATGG AAAGGTCTCA CATTCCCTGT TGAAAATTCC 
57 61 AGGCGTGTCC CTGGTAGGGG CATCCCTGTT CACCAGGTGC CTGTCATCAC CCCACTTGAC 

TCCGCACAGG GACCATCCCC GTAGGGACAA GTGGTCCACG GACAGTAGTG GGGTGAACTG 

5821 TGACATCTAC CCTGGTGACT ATGGGTTCCT CTTGTTTGTA GGGAACGGTG GCTCCAGGTG 

ACTGTAGATG GGACCACTGA TAC CCAAGGA GAACAAAC AT CCC TTGCCAC CGAGGTCCAC 

5881 GAGGCATCAA TCTGTTGGGT TCTGGTTCCC GGCTGCCTTT GGTTTTGAAA GTCTCTTCTcT 

CTCCGTAGTT AGACAACCCA AGACCAAGGG CCGACGGAAA CCAAAACTTT CAGAGAAGAG 

5 941 TGTATATTCC TACCCTGCAT TTGCTTTGTG TGGTGCTGAT GCTGTGGCAG TAGGATCTTG 
ACATATAAGG ATGGGACGTA AACGAAACAC ACCACGACTA CGACACCGTC ATCCTAGAAC 
6001 GATGACTCTC CATCAGTCAC AGACTCCCCC TGTTGCAAAG TGTCAGGCTG ACTCGACAGT 
CTACTGAGAG GTAGTCAGTG TCTGAGGGGG ACAACGTTTC ACA GTCCGAC TGAGCTGTCA 
6061 CACCGTAAAA TCTGAGTCAG TCACACACAG GCTGTCAGCC ACGGCTTCCA CTTGCATGGC 
_ _ GTGGCATTTT AGACTCAGTC AG TGTGTGTC CGACAGTCGG TGCCGAAGGT GAACGTACCG 
6121 TATTCTATTT TCACACGTGA GTTTCTGTTG CTGGCTGGCT GACTGGCATT ATCTATGCTA 
ATAAGATAAA AGTG TGCAC T CAAAGACAAC GACCGACCGA CTGACCGTAA TAGATACGAT 
6181 AGT TGAAATC AGGAGTGTGC CCAGCAGAGC CCATCATTCT CACTGTCTTT GAAACAAAGC 
TCAACT TTAG TCCTCACACG GGTCGTCTCG GGTAGTAAGA GTGACAGAAA CTTTGTTTCG 
6241 TGTACGGTTT GATCGATGAA CGTATTTAAA GCATTTCATG C AAT G AC AAA GTGCTCAGTA 
ACATGCCAAA CTAGCTACTT GCATAAATTT CGTAAAGTAC GTTACTGTTT CACGAGTCAT 
6301 GTGGAAGGCA GGCTGTGACC AGTCTGCCTG CTCCTTACTA TAATTGTGAG GATTTGTTAC 
CACCTTCCGT CCGACACTGG TCAGACGGAC GAGGAATGAT ATTAACACTC CTAAACAATG 
6361 TGGAACAGTA CATGGAGGCC TGACCTTGTG GGGGCACAGG GTGGAACCTT AGCTGAATAT 
ACCTTGTCAT GTACCTCCGG ACTGGAACAC CCCCGTGTCC CACCTTGGAA TCGACTTATA 
64 21 AGTGTGTGTC TCAAGAGGAA GTCAGGGTAC TAGCTCAGTG CTCAATCTCC AGGTACTATA 
TCACACACAG AGTTCTCCTT CAGTCCCATG ATCGAGTCAC GAGTTAGAGG TCCATGATAT 
64 81 TATACATTTG CCCGTTTTAT CTCTAATGTG AAATAAATCC CCAAACACTT GTTTATCGTG 
ATATGTAAAC GGGCAAAATA GAGATTACAC TTTATTTAGG GGTTTGTGAA CAAATAGCAC 
6541 TAGCGTACCT AAAAGACTAT TCTATTATGG GTGTCCCCAC TTTCTTGGTT TGGTCACCCC 
ATCGCATGGA TTTTCTGATA AGATAATACC CACAGGGGTG AAAGAACCAA ACCAGTGGGG 

Xbal 

VSVAWAWV 

6601 GATCCCCCGG TCTTCTGCTG TATCTAGAAC AGTGACTATA AATGATGTAT GGGAATAGTG 

CTAGGGGGCC AGAAGACGAC ATAGATCTTG TCACTGATAT TTACTACATA CCCTTATCAC 
6661 TTTCCATATG ATCTGTTGTC TGGAGTATAT GCTACATGTT CATTTACTGT ACAAAAACCC 

AAAGGTATAC TAGACAACAG ACCTCATATA CGATGTACAA GTAAATGACA TGTTTTTGGG 
6721 AGTGCAGCTG ATGATGCAAA GCAGTCTCTC TCTGTGTACA GTGCCCCACC TATTTAAAAA 

TCACGTCGAC TA CTA CGTTT CGTCAGAGAG AGACACATGT CACGGGGTGG ATAAATTTTT 
67 81 TCACGTACAA NCCCAGAACA CTGTGAAACA CTTAACATAA GAAACAAACG CAGCGTCTGG 

AGTGCATGTT NGGGTCTTGT GACACTTTGT GAATTGTATT CTTTGTTTGC GTCGCAGACC 
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PDElOa and RACEs compiled 

6841 ATTCTTTCCA AGGAGAGCAG CTTTCTCCAC AGGAACACAG TAACAAAAGA GGTCCGCCGC 
TAAGAAAGGT TCCTCTCGTC GAAAGAGGTG TCCTTGTGTC ATTGTTTTCT CCAGGCGGCG 

6901 CATCCACACC CAGCCAAGAC ACCTCAGAGG CCATAGGGAC AACCTCCTTG CTGGCCAACA 
G TAGGTGTGG GTCGGTTCTG TGGAGTCTCC GGJJ^TCCCTG TTGGAGGAAC GACCGGTTGT 

6961 CCTGCTGGAG CAGGGCACAG GTCCCAGCAA CTGATCCTCA GTGGATGGGT CCGCAGTCAA 
GGACGACCTC GTCCCGTGTC CAGGGTCGTT GACTAGGAGT CACCTACCCA GGCGTCAGTT 

Hindi EcoRV 

7 021 AGCCTTAATG GGCTCTCTTT TGAAGGGGAA AGAAANNTTT CAAGCTTATG ATATCCAACA 

TCGGAATTAC C CGAGAGAAA ACTTCCCCTT TCTTTNNAAA GTTCGAATAC TATAGGTTGT 
7 081 TTATTATAGT TGATGAGTTA GTAAATTCCG AAAAAAAAAG ATGATTTTAT ATGTATGACA 

AATAATATCA ACTACTCAAT CATTTAAGGC TTTTTTTTTC T AC T AAAAT A TACATACTGT 
7141 TAAAAAAAAT CTTTGTAAAG TGCGCAAGTG CAATAATTTA AAGAGGTCTT ATCTTTGCAT 

ATTTTTT TTA GAAACAT TTC ACGCGTTCAC GTTATTAAAT TTCTCCAGAA TAGAAACGTA 
7201 TTATAAATTA TAAATATTGT ACATGTGTGT AATTTTTCAT GTATTCATTT GCAGTCTTTG 

AATATTTAAT A TTTATA ACA TGTACACACA T TAAAAAG T A CATAAGTAAA CGTCAGAAAC 
72 61 TATTTAAAAA AACTTTACTG TTATGTTTGT ATAATAGAAC ATTAATCATT TATTATAACT 

ATAAATTTTT TTGAAATGAC AATACAAACA TATTATCTTG TAATTAGTAA ATAATATTGA 
7321 CAGACAAGGT GTAAATAAAT TCATAATTCA AACAGCCAGT ATATATGCAT ATATGGGTGT 

GT CTGTTC C A CATTTATTTA AG TAT TAAGT TTGTC G GTC A TATATACGTA TATACCCACA 
7 381 TACATTGCAA AAATCTCTAT CTTTGTTCTA TTCACATGCT TAAAGAAGTA AGAAATCTTT 

ATGTAACGTT TTTAGAGATA GAAACAAGAT AAGTGTACGA ATTTCTTCAT TCTTTAGAAA 

74 41 TGTGGATATG TAATTATACA TATAAAGTAT ATATATATGT ATGATACATG AAATATATTT 

ACACCTATAC ATTAATATGT ATATTTCATA TATATATACA TACTATGTAC TTTATATAAA 
7501 AGAAATGTTC ATAATTTTAA TGGATATTCT TTGGTGTGAA TAATTGAATA CAACATTTTT 
TCTTTACAAG TATTAAAATT ACCTATAAGA AACCACACTT ATTAACTTAT GTTGTAAAAA 

75 61 AAAATGAAAA AAAAAAAAAA C 

TTTTACTTTT TTTTTTTTTT G 
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Figure 19 

PDEl'OA compiled 

1 CGCCCGGGCA GGTCTGTTGG AGGGCAGTTG GTCAACCTGA CCAGAGAGAG CTGAGCTGGA 

GCGGGCCCGT CCAGACAACC TCCCGTCAAC CAGTTGGACT GGTCTCTCTC GACTCGACCT 

61 AGACCCCACT GATGGTGTGC TGCCTTTCAG TCCAGGAAGA AAGAAAGGAA GGATTCTGAG 

TCTGGGGTGA CJTACCACACG ACGGAAAGTC AGGTCCTTCT TTCTTTCCTT CCTAAGACTC 

121 GATTTGGGCA AAGCCACATT CCTGGAGAAG TCTGTATACT GATGCCAAAC CCAAGAGCTG 

CTAAACCCGT TTCGGTGTAA GGACCTCTTC AGACATATGA CTACGGTTTG GGTTCTCGAC 

181 AGCTGCTGAT GAGGCCCAGG GAGTAGCCCA CGCGCCCTGA GCTGTTGGCT AGCAAGGCCT 

TC GACGACTA CTCCGG GTCC CTCATCGGGT GCGCGGGACT CGACAACCGA TCGTTCCGGA 

241 TCCTGCTCCA TGTGGCATGG AAAAATTATA TGGTTTGACG GATGAAAAGG TGAAGGCCTA 

AGGACGAG GT AC ACCGTACC TTTTTAATAT ACCAAAC TGC CTACTTTTCC ACTTCCGGAT 

301 TCTTTCTCTC CATCCCCAGG TATTAGATGA ATTTGTTTCT GAAAGTGTTA GTGCAGAGAC 

A GAAA GAGAG GTAGGGGTCC AT AATCTACT TAAACAAAGA CTTTCACAAT CACGTCTCTG 

3 61 TGTGGAAAAG TGGCTGAAGA GGAAAACCAA CAAAGCAAAA GATGAACCAT CTCCCAAGGA 

ACACCTTTTC ACCG ACTTCT CCTTTTGGTT GTTTCGTTTT CTACTTGGTA GAGGGTTCCT 
421 AGTCAGCAGG TACCAGGATA CGAATATGCA GGGAGTCGTG 
TCAGTCGTCC ATGGTCCTAT GCT TATACGT CCCTCAGCAC 

4 81 AGAGCAGCGC CTGGACACGG GCGGGGACAA CCACCTGCTC 

TCTCG_TCGCG_ GACCTG TGCC CGCCCCTGTT GGTGGACGAG 

541 CATCAGGATA GCCACAAAAG CCGACGGATT TGCACTGTAC 

GTAGTCCTAT CGGTGTTTTC GGCTGCCTAA ACGTGACAT G 
601 TAGCCTGTGT GTGTTCATAC CACCCGGGAT GAAGGAAGGC 

ATCGGACACA CACAAGTATG GTGGGCCCTA CTTCCTTCCG 
661 AGGGCCCATC ACCCAGGGTA CCACCATCTC TGCCTACGTG 

TCCCGGGTAG TGGGTCCCAT GGTGGTAGAG ACGGATGCAC 
7 21 GTTGGTAGAG GATATCCTTG GGGATGAGCG ATTTCCTCGA 

CAACCATCTC CT ATAG GAAC CCCTACTCGC TAAAGGAGCT 
7 81 AACCCGCATC CAGTCTGTTC TTTGCTTGCC CATTGTCACT 

TTGGGCGTAG GTCAGACAAG AAACG AACGG GTAACAGTGA 
841 CATCCTTGAA CTGTACAGGC ACTGGGGCAA AGAGGCCTTC 

GTAGGAACTT GACATGTCCG TGACCCCGTT TCTCCGGAAG 
901 TGCAACAGCC AATCTTGCTT GGGCTTCCGT AGCAATACAC 

ACGTTGTCGG TTAGAACGAA CCCGAAGGCA TCGTTATGTG 



GTATCAAAGA CATACTTTGA 
CATAGTTTCT GTATGAAACT 
ATATATGCAA AAAATCTAGT 
TATATACGTT TTTTAGATCA 
AACAAGGAGC TGTACTCGGA 
TTGTTCCTCG ACATGAGCCT 

TTCAAGAAGA CCAAGGAGAT 
AAGTTCTTCT GGTTCCTCTA 

AGAACAGGCG AAGTCTTGAA 
TCTTGTCCGC TTCAGAACTT 
GTGGACCTGT ACACAGGCTA 
CACCTGGACA TGTGTCCGAT 
GGCAGCGTGA TTGGCGTGGT 
CCGTCGCACT AACCGCACCA 



TACGAGCTGA 
ATGCTCGACT 
CTCTATGAGC 
GAGATACTCG 
TTCCTTGGAG 
AAGGAACCTC 
CAACCCCGGC 
GTTGGGGCCG 
GCCAAGTCTA 
CGGTTCAGAT 
GGTACTGGCC 
CCATGACCGG 
GCCATTGGAG 
CGGTAACCTC 
TGCCTCAGCC 
ACGGAGTCGG 
CAGGTGCAGG 
GTCCACGTCC 



ACAGCTACAT 
TGTCGATGTA 
TCAGCAGCAT 
AGTCGTCGTA 

AGTGCAATAA 
TCACGTTATT 
TCATCCCTGC 
AGTAGGGACG 
GGAAGACGTT 
CCTTCTGCAA 
TGGAATCAGG 
ACCTTAGTCC 
ACTTGATTGG 
TGAACTAACC 
ATCAGGAGGT 
TAGTCCTCCA 
TGTGTAGAGG 
ACACATCTCC 



9 61 TCTCGCCAAA CAGACCGAAC TGAATGACTT CCTACTCGAC 

AGAGCGGTTT GTCTGGCTTG AC T T AC T G AA GGATGAGCTG 
1021 TAACATAGTT GCCATAGACT CTCTACTTGA ACACATCATG 

ATTGTATCAA CGGTATCTGA GAGATGAACT TGTGTAGTAC 
1081 GAACGCCGAC CGCTGCGCGC TCTTCCAGGT GGACCACAAG 

CTTGCGGCTG GCGACGCGCG AGAAGGTCCA CCTGGTGTTC 
1141 CCTGTTTGAC ATTGGGGAGG AGAAGGAGGG GAAGCCCATC 

GGACAAACTG TAACCCCTCC TCTTCCTCCC CTTCGGGTAG 
1201 CAGATTTTCC AT TG AGAAAG GGATTGCTGG TCAAGTGGCA 

GTCTAAAAGG TAACTCTTTC CCTAACGACC AGTTCACCGT 
12 61 CATTCCCGAT GCCTACGCGG ACCCTCGCTT TAACAGGGAG 

GTAAGGGCTA CGGATGCGCC _TGGGAGCGAA ATTGTCCCTC 
1321 CACCACGAGG AACATTCTGT GTATGCCCAT AGTGAGCCGA 

GTGGTGCTCC TTGTAAGACA CATACGGGTA TCACTCGGCT 
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PDE10A compiled 

1381 GCAGATGGTG AACAAGATCA GCGGTAGCGC CTTCTCCAAG ACAGACGAGA ACAACTTCAA 

CGTCTACCAC TTGTTCTAGT CGCCATCGCG GAAGAGGTTC TGTCTGCTCT TGTTGAAGTT 
14 41 GATGTTTGCT GTCTTCTGCG CACTGGCCTT GCACTGTGCT AACATGTACC ACAGGATCCG 

CTACAAACGA CAGAAGACGC GTGACCGGAA CGTGACACGA TTGTACATGG TGTCCTAGGC 
1501 CCACT CAGAA TGCATCTACA GGGTTACCAT GGAGAAGCTT TCCTACCACA GCATCTGCAC 

GGTGAGTCTT ACGTAGATGT CCCAATGGTA CCTCTTCGAA AGGATGGTGT CGTAGACGTG 
1561 CTCCGAGGAG TGGCAAGGCC TCATGCGCTT CAACCTACCA GCACGCATCT GCCGGGACAT 

GAGGCTCCTC ACCGTTCCGG AGT ACGCGAA GTTGGATGGT CGTGCGTAGA CGGCCCTGTA 
1621 CGAGCTATTC CACTTTGACA TTGGTCCTTT CGAGAACATG TGGCCTGGGA TCTTTGTCTA 

GCTCGATAAG GTGAAACTGT AACCAGGAAA GCTCTTGTAC ACCGGACCCT AGAAACAGAT 
1681 CATGATCCAT CGGTCTTGTG GGACATCCTG TTTTGAACTT GAAAAATTGT GCCGTTTTAT 

GTACTAGGTA GCCAGAACAC CCTGTAGGAC AAAACTTGAA CTTTTTAACA CGGCAAAATA 

17 41 CATGTCTGTG AAGAAGAACT ATCGGCGGGT TCCTTACCAC AACTGGAAGC ATGCAGTCAC 

GTACAGACAC TTCTTCTTGA TAGCCGCCCA AGGAATGGTG TTGACCTTCG TACGTCAGTG 
1801 GGTGGCACAC TGCATGTATG CCATACTTCA AAACAACAAT GGCCTCTTCA CAGACCTCGA 
CCACCGTGTG ACGTACATAC GGTATGAAGT TTTGTTGTTA CCGGAGAAGT GTCTGGAGCT 

18 61 GCGCAAAGGC CTGCTAATTG CGTGTCTGTG CCATGACCTG GACCACAGGG GCTTCAGTAA 

CGCGTTTCCG GACGATT AAC GCACAGACAC GGTACTGGAC CTGGTGTCCC CGAAGTCATT 
1921 CAGCTACCTG CAGAAGTTCG ACCACCCCCT GGCGGCGCTG TACTCCACCT CCACCATGGA 

GTCGATGGAC GTCTTCAAGC TGGTGGGGGA CCGCCGCGAC ATGAGGTGGA GGTGGTACCT 
1981 GCAACACCAC TTCTCCCAGA CGGTGTCCAT CCTTCAGCTG GAAGGGCACA ATATCTTCTC 

CGTTGTGGTG AAGAGGGTCT GCCACAGGTA GGAAGTCGAC CTTCCCGTGT TATAGAAGAG 

2041 CACCCTGAGC TCCAGCGAGT ACGAGCAGGT GCTGGAGATC ATCCGCAAAG CCATCATCGC 

GTGGGACTCG AGGTCGCTCA TGCTCGTCCA CGACCTCTAG TAGGCGTTTC GGTAGTAGCG 

2101 CACCGACCTC GCCCTATACT TTGGGAACAG GAAGCAGTTG GAGGAGATGT ACCAGACAGG 

GTGGCTGGAG CGGGATATGA AACCCTTGTC CTTCGTCAAC CTCCTCTACA TGGTCTGTCC 

2161 GTCGCTGAAC CTCCACAACC AGTCCCATCG AGACCGTGTC ATCGGCTTGA TGATGACTGC 

CAGCGACTTG GAGGTGTTGG TCAGGGTAGC TCTGGCACAG TAGCCGAACT ACTACTGACG 

2221 CTGTGATCTT TGCTCTGTGA CCAAACTATG GCCAGTTACA AAATTGACAG CGAAT GAT AT 

GACACTAGAA ACGAGACACT GGTTTGATAC CGGTCAATGT TTTAACTGTC GCTTACTATA 

2281 ATATGCAGAA TTCTGGGCTG AGGGTGATGA GATGAAGAAG CTGGGCATAC AGCCCATTCC 

TATACGTCTT AAGACCCGAC TCCCACTACT CTACTTCTTC GACCCGTATG TCGGGTAAGG 

2341 TATGATGGAC AGAGACAAGC GAGATGAAGT CCCTCAAGGG CAGCTCGGAT TCTACAATGC 

ATACTACCTG TCTCTGTTCG CTCTACTTCA GGGAGT TCCC GTCGAGCCTA AGATGTTACG 

24 01 TGTGGCCATT CCCTGCTATA CCACCTTGAC GCAGATCCTC CCACCCACAG AGCCTCTGCT 

ACACCGGTAA GGGACGATAT GGTGGAACTG CGTCTAGGAG GGTGGGTGTC TCGGAGACGA 

24 61 GAAGGCCTGC AGGGATAACC TCAATCAGTG GGAGAAGGTA ATTCGCGGGG AAGAGACAGC 

CTTCCGGACG TCCCTATTGG AGTTAGTCAC CCTCTTCCAT TAAGCGCCCC TTCTCTGTCG 

2521 AATGTGGATT TCAGGCCCAG GCCCGGCGCC TAGCAAGAGC ACACCTGAGA AGCTGAACGT 

TTACACCTAA AGTCCGGGTC CGGGCCGCGG ATCGTTCTCG TGTGGACTCT TCGACTTGCA 

2581 GAAGGTTGAA GACTGATCCT GAAGTGACGT CCTGATGTCT GCCCAGCAAC CGACTCAACC 

CTTCCAACTT CTGACTAGGA CTTCACTGCA GGACTACAGA CGGGTCGTTG GCTGAGTTGG 

2 641 TGCTTCTGTG ACTTCGTTCT TTTTGTTTTC AAGGGGTGAA AACCCCCTGT CAGAAGGTAC 

ACGAAGACAC TGAAGCAAGA AAAACAAAAG TTCCCCACTT TTGGGGGACA GTCTTCCATG 

27 01 CGTCGCATAT CCATGTGAAG CAGACGACTC CCTGCTTGCC GCACACACCT CGGACAGTGA 

GCAGCGTATA GGTACAC TTC GTCTGCTGAG GGACGAACGG CGTGTGTGGA GCCTGTCACT 
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27 61 GCAACCCAGG CTCTGCCGTG TTCAGACGTC GGCTACTCCG TGGCTCCACC TGACCTCCGA 

CGTTGGGTCC GAGACGGCAC AAGTCTGCAG CCGATGAGGC ACCGAGGTGG ACTGGAGGCT 
~2 821 "atgctatttg CTCCCAGGCC AGCACTGCAC TGTCTGGAGG GGGCAGAGAC CACAGGAGAG 
TACGATAAAC GAGGGTCCGG TCGTGACGTG ACAGACCTCC CCCGTCTCTG GTGTCCTCTC 

28 8r~CTTC^GCCT~GCATCCTCCc" ATGAGGGTGT GGCCAGTTCC CTAGTTCTGT GCCATGCTGC 

CAAGAACGGA CGTAGGAGGG TACTCCCACA CCGGTCAAGG GATCAAGACA CGGTACGACG 
2941 TGCTTGGTGG CATTGGTTAG GAATGGGACA CACGCCCCTT GTTGTGAAGT TTACATGTGA 

ACGAACCACC GTAACCAATC CTTACCCTGT GTGCGGGGAA CAACACTTCA AATGTACACT 
3001 CCTTCTTATA GGTTAACTGA GTTTGTGGCC TGGGACACAT GTAATGAAGG TCACAGTCCA 

GGAAGAATAT CCAATTG ACT CAAACACCGG ACCCTGTGTA CATTACTTCC AGTGTCAGGT 

'"306i~" c~a"gg¥gacag"agaaatccaa"actgttgatt ACAGGTGCAC TACAGGTATG CTCTTTCAGT 
GTCCACTGTC TCTTTAGGTT TGACAACTAA TGTCCACGTG ATGTCCATAC GAGA AAGTCA 



3121 CTATCTGGGG GCACATAGGT GAGTCTGCTC CACTCAGAAG GAAGCATACC TCTSCCCTCA 

GATAGACCCC CGTGTATCCA CTCAGACGAG GTGAGTCTTC CTTCGTATGG AGASGGGAGT 
3181 ~ TCCAGGGGAC ACAGGGTACA TCCCAGGCAT CGGGGAACTG AAGCTCTCAC TTCAAACCAT 

AGGTCCCCTG TGTCCCATGT AGGGTCCGTA GCCCCTTGAC TTCGAGAGTG AAGTTTGGTA 
3241 Vtcaaagaat TAAAACACCT CCCCTCCCCC TCACTGTAGC CTTCGGCAAC TGCGCCAATC 

CAGTTTCTTA ATTTTGTGGA GGGGAGGGGG AGTGACATCG GAAGCCGTTG ACGCGGTTAG 
3301 CCTTTATACA AA^AAAATAt"a^GTAAGGCA TATAAATTTC CTCCAGCAAG CAAATCTTGT 

GGAAATATGT TTCTTTTATA TTCATTCCG^T ATATTTAAAG GAGGTCGTTC GTTTAGAACA_ 
" 3361 GGGTAAAAAA AAAAAATGTG AATTTTAACA ACCTCTATAT TTTCACTGTA TGTTATGGCA 

CCCATTTTTT TTTTTTACAC TTAAAATTGT TGGAGATATA AAAGTGACAT ACAATACCGT 
"342i""gaaVt¥tagt" ca^tc^a^"a^ ATTCCAGAAG ATACCTCATC CTATGCCTGA 

CTTAAAATCA GTGCAGGTTT TGTTTTCTAA TAAGGTCTTC TATGGAGTAG GATACGGACT 
"34 81~ AAGCTCCACA _ GCATGGCGTC CGTCTCCCAg'gGTTCTGATC CGTCTCCTCA CGGTGCAATC 

TTCGAGGTGT CGTA CCGCAG G CAGAGGGTC CCAAGACTAG GCAGAGGAGT GCCACGTTAG_ 
" 35 41^"Xggcaggaca"ggaggaggtg CAGGGCTACC ACATTGACCC AGATGGTATC tcctctcacc 

TCCGTCCTGT CCTCCTCCAC GTCCCGATGG TGTAACTGGG TCTACCATAG AGGAGAGTGG_ 
~3601~" XtTCAGACAT~CCATAAGGAA "TGCCAAAT GC " TGTATTGAAT AGTTCTCCTG TGTGACTTTC 

TAAGTCTGTA GGTATTCCTT ACGGTTTACG ACATAACTTA TCAAGAGGAC ACACTGAAAG 
^366i" '"tagagaagcc AGGA^CCC~TGAGCCTTTC CTGGGAACTC CTAAGGAAGT CACAGGTTCA 

ATCTCTTCGG TCCTGTGGGG _ACTCGGAAAG GACCCTTGAG GATTCCTTCA GTGTCCAAGT 

372i""caccgtg^gg"aYtttca^a~tag^^^ CCCGGTTCGG TTGTTCTCAC 

GTGGCACCCC TAAAAGTCC_T ATCGTACCTC TGGTCTCTTA GGGCCAAGCC AACAAGAGTG 
" 378l"~TCGGTGAGCC TTGAGAAGGA AG AG AC T G AC CAGAAACACT CACTCAGCAC TCTGGCAGGA 
AGCCACTCGG AACTCTTCCT TCTCTGACTG GTCTTTGTGA GTGAGTCGTG AGACCGTCCT 

"38 4i" 'gcaggagaag _ atactttaag^mgaatcttt "gggatagatt TTGATACACC CAATACCATA 

CGTCCTCTTC TATGAAATTC TACTTAGAAA CCCTATCTAA AACTATGTGG GTTATGGTAT 
3901 CACACAGGAG CTTGGCATTT GCAAAGTCTA TTCAGTTTCC TTCCACACTC TGACCCACGG 
GTGTGTCCTC GAACCGTAAA CGTTTCAGAT AAGTCAAAGG AAGGTGTGAG ACTGGGTGCC 

3 96i""ttgtagcgga" GTGGGCTGAA^CACTGTAACA CTGTACATGC GATTTCCCCA TGGGCTTCTA 

AACATCGCCT CACCCGACTT GTGACATTGT GACATGTACG CTAAAGGGGT ACCCGAAGAT_ 

4 021" AAATGTCACC ATCTCCTCCC CTGCTGTGTC CTACTCCATT TACTGGTTAC AAGGTGATGT 

TTTACAGTGG TAGAGGAGGG GACGA CACAG_GATGAG^gTAA_A T gaCCAATG TTCCACTACA 
~4081 CAACAAGAGA~AGCTATCACA ACACCAGGGC TGTGCACACG TGCACACACA TGTATGCACA 
GTTGTTCTCT TCGATAGTGT TGTGGTCCCG ACACGTGTGC ACGTGTGTGT ACATACGTGT 
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4141 AGCACACAGA TGTATGTACA GCACACACAC ACACACACAC CCCAAAAGGA GAGAAAAGGA 

TCGTGTGTCT ACATACATGT CGTGTGTGTG TGTGTGTGTG GGGTTTTCCT CTCTTTTCCT 
4201 AGAAAACATT TATAAAAAGC GACAGCTACC CCCATATTCA AAAATAGTTC TTTTCCCTGT 

TCTTTTGTAA ATATTTTTCG CTGTCGATGG GGGTATAAGT TTTTATCAAG AAAAGGGACA 
42 61 AGGGAAACAG GTAGCTCTCC ATAAGGAAAT TATCATGAGT GTGTTCTCCC ATCAGTGCAC 

TCCCTTTGTC CATCGAGAGG TATTCCTTTA ATAGTACTCA CACAAGAGGG TAGTCACGTG 
4321 TTCTCCCAGG GGTGCTCACT GAAGCTGGTC CACGTCTATA AACAGGTGAC ACTGGCTGCA 

AAGAGGGTCC CCACGAGTGA CTTCGACCAG GTGCAGATAT TTGTCCACTG TGACCGACGT 
4 381 GCAAAAAGCC ATTCGATCCA CACAAATTGA TCTTCTATCA TCTTGGAATC TGAATTGCAG 

CGTTTTTCGG TAAGCTAGGT GTGTTTAACT AG AAG AT AG T AGAACCTTAG ACTTAACGTC 
44 41 GGAGGAGCAG CATGTAAGAC GACCGTTTAA TTCAGGCATT CCGAAGGCAT GAGCGCATGG 

CCTCCTCGTC GTACATTCTG CTGGCAAATT AAGTCCGTAA GGCTTCCGTA CTCGCGTACC 
4501 ATTCTGTCAC CAAGCGTATA AAAGGACCCT GGCATTGGGA AACCTATGAC GGACTGTTTT 

TAAGACAGTG GTTCGCATAT TTTCCTGGGA CCGTAACCCT TTGGATACTG CCTGACAAAA 
4561 TGCTGTAGAA GTAGGGATTT TACAGAAGTC TCCTTGGATT TGCCCTGCCT GGGGCAGTTT 

ACGACATCTT CATCCCTAAA ATGTCTTCAG AGGAACCTAA ACGGGACGGA CCCCGTCAAA 
4 621 TGCAGAGGAA CCTGCCAGAG ATTTATTGGC TGGTCAGTCT CTTGTGAAAT AGTATCATGT 

ACGTCTCCTT GGACGGTCTC TAAATAACCG ACCAGTCAGA GAACACTTTA TCATAGTACA 
4 681 GAGAAACAGT TTGTAGAAAA AAACTATACC TGGGAAGACC TTTGCAACAT TGTTCCTTCC 

CTCTTTGTCA AACATCTTTT TTTGATATGG ACCCTTCTGG AAACGTTGTA ACAAGGAAGG 
4741 ATGGGCCAAG ACTCAGTTAG GAGGCATAAA TCTGCCCGGA ATAAACTAGG CCAGGATACA 

TACCCGGTTC TGAGTCAATC CTCCGTATTT AGACGGGCCT TATTTGATCC GGTCCTATGT 
4 801 GCCATGTTTA GTTAATAATT TGGTTTTAGA ATTCACACAG GCAGGATTGG TTTTTTTGTG 

CGGTACAAAT CAATTATTAA ACCAAAATCT TAAGTGTGTC JCGTCCTAACC A AAAAAACAC 
4861 TCTTGGCAAG TGGAGCATAT TTAACATACA GGCATGGGAA TCCTGCCTCT TAGCTTTTCC 

AGAACCGTTC ACCTCGTATA AATTGTATGT CCGTACCCTT AGGACGGAGA ATCGAAAAGG 
4 921 CACCCTCTTG TCTCACCAAG TTTTTTCTCT CCAAAGGTTT CCAGGAATTT CTCATTAATG 

GTGGGAG AAC AGAGTGGTTC AAAAAAG AGA GGTTTCCAAA GGTCCT_TAAA_ GAGTAATTAC 

4 981 GCTGATGCAA ACTTAGTGAA TAATAATGAA TATAAACAAT GCTCACCTCA CCAAAAT TAT 

CGACTACGTT TGAATCACTT ATTATTACTT ATATTTGTTA CGAGTGGAGT GGTTTTAATA 
"5041 ATT AT T T GC A GTCATTTGTG ATAACACAAA TTTTATCGCA ATGGTTATTA TTTAATTTGT 

TAATAAACGT CAGTAAACAC TATTGTGTTT AAAATAGCGT TACCAATAAT AAATTAAACA 
5101 GGCCACACAC TGTGGTTATC TTTTGTTGTG GTTGTTTCTG AGAAAATGTT CTTGGATATG 

CCGGTGTGTG ACACCAATAG_ AAAACAACAC CAACAAAGAC TCTTTTACAA GAACCTATAC 
5161 TAAGTGCCAA TACCAGTGTG AAGTATTGAT CCCGGGCAGC AAAATACAGC CTAAGGTTTG 

ATTCACGGTT ATGGTCACAC TTCATAACTA GGGCCCGTCG TTTTATGTCG GATTCCAAAC 
5221 TAAACATCAA TTCTATCTCA GTTCATCAGA GGGCCTGAGA AGCTGCGGGG CAGTGTAAAG 

ATTTGTAGTT AAGATAGAGT CAAGTAGTCT CCCGGACTCT TCGACGCCCC GTCACATTTC 
5281 TAAAGTATGC TGGGCTGGTG GTGGTCAGCC TCCCCTTGCC AAGAAGAGAG CAATTGAATC 

ATTTCATACG ACCCGACCAC CACCAGTCGG AGGGGAACGG TTCTTCTCTC GTTAACTTAG 
5341 CTGTCCCCAG CTCCCTCCAC GCCTGAAGAG TGACCAGTGC TGGCCCGACG GATCGCTGAG 

GACAGGGGTC GAGGGAGGTG CGGACTTCTC ACTGGTCACG ACCGGGCTGC CTAGCGACTC 

5 4 01 " ATATTCTCCC ATAATGGCAA AAAAATAGGC AGTTTGATGT GACCTGTTTA GTGTGGCTCT 

TATAAGAGGG TATTACCGTT TTTTTATCCG TCAAACTACA CTGGACAAAT CACACCGAGA 
5461 CCTCTTTTGA GCATGTGTTA GCATTTTTAT TTTATACTCA TCCAGTGAAC TCTGCTCTTC 
GGAGAAAACT CGTACACAAT CGTAAAAATA AAATATGAGT AGGTCACTTG AG AC GAG AAG 
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5521 CAAGTGTGTT CATGTATGTG CTAGATATAT TAGCACAGCC TGCCTTCTGC TGCACAACGC 

GTTCACACAA GTACAT ACAC GATCTATATA ATCGTGTCGG ACGGAAGACG ACGTGTTGCG 
5581 CTTAGAGACC CGGCCTTTCA ATGAGCTTAG CTTGTGCTCT GTTTCTGCTC TCTTAGGTCT 

GAATCTCTGG GCCGGAAAGT TACTCGAATC GAACACGAGA CAAAGACGAG AGAATCCAGA 
5641 AAACTATGGT GTCAGTTTTA ATAGAACAAA AGTATGCATC TTGCCTTGGC TTGAGCCTTT 

TTTGATAC C A CAGTCAAAAT TATCTTGTTT TCATACGTAG AACGGAACCG AACTCGGAAA 
5701 TCGTTTTCAA TGCTGACTTC TCCCCTTTCT CTCCTGTGCT CACCTTACCT TTCCAGAGTG 

AG CAAAAGTT ACGACT GAAG AGGGGAAAGA GAGGACACGA GTGGAATGGA AAGGTCTCAC 

57 61 TAAGGGACAA CTTTTAAGGA GGCGTGTCCC TGGTAGGGGC ATCCCTGTTC ACCAGGTGCC 

ATTCCCTGTT GAAAATTCCT CCGCACAGGG ACCATCCCCG TAGGGACAAG TGGTCCACGG 

5 8 21 TGTCATCACC CCACTTGACT GACATCTACC CTGGTGACTA TGGGTTCCTC TTGTTTGTAG 

ACAGTAGTGG GGTGAACTGA CTGTAGATGG GACCACT GAT ACCCAAGGAG AACAAACATC 
5881 GGAACGGTGG - CTCCAGGTGG AGGCATCAAT CTGTTGGGTT CTGGTTCCCG GCTGCCTTTG 

CCTTGCCACC GAGGTCCACC TCCGTAGTTA GACAACCCAA GACCAAGGGC CGACGGAAAC 
5941 ~GTTTTGAAAG TCTCTTCTCT GTATATTCCT ACCCTGCATT TGCTTTGTGT GGTGCTGATG 

CAAAACTTTC AGAGAAGAGA CATATAAGGA TGGGACGTAA AC G AAAC AC A CCACGACTAC 
6001 CTGTGGCAGT AGGATCTTGG ATGACTCTCC ATCAGTCACA GACTCCCCCT GTTGCAAAGT 

GACACCGTCA TCCTAGAACC TACTGAGAGG TAGTCAGTGT CTGAGGGGGA CAACGTTTCA 
60 61 "gtcaggctgaTctcgacagtc ACCGTAAAAT CTGAGTCAGT CACACACAGG CTGTCAGCCA 

CAGTCCGACT GAGCTGTCAG TGGCATTTTA GACTCAGTCA GTGTGTGTCC GACAGTCGGT 
6121 "CGGCTTCCAC TTGCATGGCT ATTCTATTTT CACACGTGAG TTTCTGTTGC TGGCTGGCTG 

GCCGAAGGTG AACGTACCGA TAAGATAAAA GTGTGCACTC AAAGACAACG ACCGACCGAC 
6181 ACTGGCATTA TCTATGCTAA GTTGAAATCA GGAGTGTGCC CAGCAGAGCC CATCATTCTC 

TGACCGTAAT AGATACGATT CAACTTTAGT CCTCACACGG GTCGTCTCGG GTAGTAAGAG 
62 4 1~ " actgtctttg" AAACAAAGCT GTACGGTTTG ATCGATGAAC GTATTTAAAG CATTTCATGC 

TGACAGAAAC TTTGTTTCGA CATGCCAAAC TAGCTACTTG CATAAATTTC GTAAAGTACG 
~630l AATG^AAAG TGCTCAGTAG TGGAAGGCAG GCTGTGACCA GTCTGCCTGC TCCTTACTAT 

TTACTGTTTC ACGAGTCATC ACCTTCCGTC CGACACTGGT CAGACGGACG AGGAATGATA 
6361 AATTGTGAGG~ATTTGTTACT GGAACAGTAC ATGGAGGCCT GACCTTGTGG GGGCACAGGG 

TTAACACTCC TAAACAATGA CCTTGTCATG TACCTCCGGA CTGGAACACC CCCGTGTCCC 
6421 TGGAACCTTA "~GCTGAATATA _ G~TGTGTGTCT CAAGAGGAAG TCAGGGTACT AGCTCAGTGC 

ACCTTG GAAT CGACTTATAT CACA CACAGA GTTCTCCTTC AGTCCCATGA TCGAGTCACG 
6481 TCAATCTCCA GGTACTATAT ATACATTTGC CCGTTTTATC TCTAATGTGA AATAAATCCC 

AG TTAGAGGT CCATGATAT A TATGTAAACG GGCAAAATAG AGATTACACT TTATTTAGGG 
6541 CAAACACTTG TTTATCGTGT AGCGTACCTA AAAGACT AT T CTATTATGGG TGTCCCCACT 

GTTTGTGAAC AAATAGCACA TCGCATGGAT TTTCTGATAA GATAATACCC ACAGGGGTGA 
6601 " TTCTTGGTTT GGTCACCCCG ATCCCCCGGT CTTCTGCTGT ATCTAGAACA GTGACTATAA 

AAG AAC C AAA CCAGTGGGGC TAGGGGGCCA GAAG AC G AC A TAGATCTTGT CACTGATATT 
6661 ATGATGTATG GGAATAGTGT TTCCATATGA TCTGTTGTCT GGAGTATATG CTACATGTTC 

TACTACATAC CCTTATCACA AAGGTATACT AGACAACAGA CCTCATATAC GAT GT AC AAG 
67 2 1 ATTTACTGTA CAAAAACCCA GTGCAGCTGA TGATGCAAAG CAGTCTCTCT CTGTGTACAG 

TAAATGACAT GTTTTTGGGT CACGTCGACT ACTACGTTTC GTCAGAGAGA GACACATGTC 
6781 TGCCCCACCT ATTTAAAAAT CACGTACAAN CCCAGAACAC TGTGAAACAC TTAACATAAG 

ACGGGGTGGA TAAATTTTTA GTGCATGTTN GGGTCTTGTG ACACTTTGTG AATTGTATTC 
"684~1 AAACAAACGC AGCGTCTGGA TTCTTTCCAA GGAGAGCAGC TTTCTCCACA GGAACACAGT 

TTTGTTTGCG TCGCAGACCT AAGAAAGGTT CCTCTCGTCG AAAGAGGTGT CCTTGTGTCA 
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6901 AACAAAAGAG GTCCGCCGCC ATCCACACCC AGCCAAGACA CCTCAGAGGC CATAGGGACA 

TTGTTTTCTC CAGGCGGCGG TAGGTGTGGG TCGGTTCTGT GGAGTCTCCG GTATCCCTGT 
6961 ACCTCCTTGC TGGCCAACAC CTGCTGGAGC AGGGCACAGG TCCCAGCAAC TGATCCTCAG 

TGGA GGAACG ACCGGTTGTG G ACGAC CTCG TCCCGTGTCC AGGGTCGTTG ACTAGGAGTC 

7021 TGGATGGGTC CGCAGTCAAA GCCTTAATGG GCTCTCTTTT GAAGG GGAAA GAAANNTTTC 

ACCTACCCAGGCGTCAGTTT CGGAATTACC CGAGAGAAAA CTTCCCCTTT CTTTNNAAAG 
7081 AAGCTTATGA TATCCAACAT TATTATAGTT GATGAGTTAG TAAATTCCGA AAAAAAAAGA 

TTCGAATACT ATAGGTTGTA ATAATATCAA CTACTCAATC ATTTAAGGCT TTTTTTTTCT 
7141 TGATTTTATA TGTATGACAT AAAAAAAATC TTTGTAAAGT GCGCAAGTGC AATAATTTAA 

_ ACTAAAATAT ACATACTGTA TTTTTTTTAG AAACATTTCA CGCGTTCACG TTATTAAATT 

72 01 AGAGGTCTTA TCTTTGCATT TATAAATTAT AAATATTGTA CATGTGTGTA ATTTTTCATG 

TCTC CAGA AT A G AAACGTAA ATAT TTAATA TTTATAACAT GTACACACAT TAAAAAGTAC 

7 261 TATTCATTTG CAGTCTTTGT ATTTAAAAAA ACTTTACTGT TATGTTTGTA TAATAGAACA 

ATAAGTAAAC GTCAG AAACA TAAATTTTTT TGAAATGACA ATACAAACAT ATTATCTTGT 

7321 TTAATCATTT ATTATAACTC AGACAAGGTG TAAATAAATT CATAATTCAA ACAGCCAGTA 

AATTAGTAAA TAATATTGAG TCTGTTCCAC ATTTATTTAA GTATTAAGTT TGTCGGTCAT 
7 381 TATATGCATA TATGGGTGTT ACATTGCAAA AATCTCTATC TTTGTTCTAT TCACATGCTT 

ATATACGTAT ATACCC ACAA T GTAACGTTT TTAGAGATAG AAACAAGATA AGTGTACGAA 

7 4 41 AAAGAAGTAA GAAATCTTTT GTGGATATGT AATTATACAT ATAAAGTATA TATATATGTA 

T TTCTTCATT CTTTAGAAAA CACCTA TACA TTAATATGTA TATTTCATAT ATATATACAT 

7 501 TGATACATGA AAT AT AT T T A GAAATGTTCA TAATTTTAAT GGATATTCTT TGGTGTGAAT~ 

ACTATGTAC T TTATA T AAAT CTTTACAAGT ATTAAAATTA CCTATAAGAA ACCACACTTA 
7 5 61 AATTGAATAC AACATTTTTA AAATGAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 

TTAACTTATG TTGTAAAAAT TTTACTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTTT 
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SEQUENCE LISTING 

<aiO> ROBERTSON, Harold 

DENOVAN- WRIGHT, Eileen 
NOVANEURON , INC . 

<120> GENE NECESSARY FOR STRIATAL FUNCTION, USES THEREOF, AND 
COMPOUNDS FOR MODULATING SAME 

<130> 36541-0005 

<140> 
<141> 

<150> US60/158, 043 
<151> 1999-10-07 

<150> US60/217 , 765 
<151> 2000-07-12 

<160> 12 

<170> Patentln Ver . 2 

<210> 1 
<211> 3236 
<212> DNA 
<213> mouse 

<400> 1 

cactgaagct ggtccacgtc 
tccacacaaa ttgatcttct 
agacgaccgt ttaattcagg 
tataaaagga ccctggcatt 
attttacaga agtctccttg 
agagatttat tggctggtca 
aaaaaaacta tacctgggaa 
ttaggaggca taaatctgcc 



tataaacagg tgacactggc tgcagcaaaa agccattcga 60 
atcatcttgg aatctgaatt gcagggagga gcagtatgta 120 
cattccgaag gcatgagcgc atggattctg tcaccaagcg 180 
gggaaaccta tgacggactg tttttgctgt agaagtaggg 240 
aatttgccct gcctggggca gttttgcaga ggaacctgcc 300 
gtctcttgtg aaatagtatc atgtgagaaa cagtttgtag 360 
gacctttgca acattgttcc ttccatgggc caagactcag 420 
cggaataaac taggccagga tacagccatg tttagttaat 480 
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aatttggttt tagaattcac acaggcagga 
atatttaaca tacaggcatg ggaatcctgc 
caagtttttt ctctccaaag gtttccagga 
tgaataataa tgaatataaa caatgctcac 
tgtgataaca caaattttat cgcaatggtt 
tatcttttgt tgtggttgtt tctgagaaaa 
tgtgaagtat tgatcccggg cagcaaaata 
ctcagttcat cagagggcct gagaagctgc 
ggtggtggtc agcctcccgc ctgaagagtg 
attctcccat aatggcaaaa aaataggcag 
tcttttgagc atgtgttagc atttttattt 
agtgtgttca tgtatgtgct agatatatta 
tagagacccg gcctttcaat gagcttagct 
actatggtgt cagttttaat agaacaaaag 
gttttcaatg ctgacttctc ccctttctct 
agggacaact tttaaggagg cgtgtccctg 
tcatcacccc acttgactga catctaccct 
aacggtggct ccaggtggag gcatcaatct 
tttgaaagtc tcttctctgt atattcctac 
gtgcgcagta ggattcttgg atgactctcc 
gtcaggctga ctcgacagtc accgtaaaat 
cggcttccac ttgcatggct attctatttt 
actggcatta tctatgctaa gttgaaatca 
tgtctttgaa acaaagctgt acggtttgat 
tgacaaagtg ctcagtagtg gaaggcaggc 
ttgtgaggat ttgttactgg aacagtacat 
gaaccttagc tgaatatagt gtgtgtctca 
aatctccagg tactatatat acatttgccc 
aacacttgtt tatcgtgtag cgtacctaaa 
cttggtttgg tcaccccgat cccccggtct 
gatgtatggg aatagtgttt ccatatgatc 
ttactgtaca aaaacccagt gcagctgatg 
ccccacctat ttaaaaatca cgtacaascc 
caaacgcagc gtctggattc tttccaagga 
aaaagaggtc cgccgccatc cacacccagc 
tccttgctgg ccaacacctg ctggagcagg 
gatgggtccg cagtcaaagc cttaatgggc 
agcttatgat atccaacatt attatagttg 
gattttatat gtatgacata aaaaaaatct 



ttggtttttt tgtgtcttgg caagtggagc 54 0 
ctcttagctt ttcccaccct cttgtctcac 600 
atttctcatt aatggctgat gcaaacttag 660 
ctcaccaaaa ttatattatt tgcagtcatt 720 
attatttaat ttgtggccac acactgtggt 780 
tgttcttgga tatgtaagtg ccaataccag 840 
cagcctaagg tttgtaaaca tcaattctat 900 
ggggcagtgt aaagtaaagt atgctgggct 96 0 
accagtgctg gcccgacgga tcgctgagat 1020 
tttgatgtga cctgtttagt gtggctctcc 1080 
tatactcatc cagtgaactc tgctcttcca 1140 
gcacagcctg ccttctgctg cacaacgcct 1200 
tgtgctctgt ttctgctctc ttaggtctaa 1260 
tatgcatctt gccttggctt gagccttttc 1320 
cctgtgctca ccttaccttt ccagagtgta 1380 
gtaggggcat ccctgttcac caggtgcctg 144 0 
ggtgactatg ggttcctctt gtttgtaggg 15 0 0 
gttgggttct ggttcccggc tgcctttggt 1560 
cctgcatttg ctttgtgtgg tgctgatgct 1620 
atcagtcaca gactccccct gttgcaaagt 1680 
ctgagtcagt cacacacagg ctgtcagcca 174 0 
cacacgtgag tttctgttgc tggctggctg 1800 
ggagtgccca gcagagccca tcattctcac 1860 
cgatgaacgt atttaaagca tttcatgcaa 1920 
tgtgaccagt ctgcctgctc cttactataa 1980 
ggaggcctga ccttgtgggg gcacagggtg 2 04 0 
agaggaagtc agggtactag ctcagtgctc 2100 
gttttatctc taatgtgaaa taaatcccca 2160 
agactattct attatgggtg tccccacttt 2220 
tctgctgtat ctagaacagt gactataaat 2280 
tgttgtctgg agtatatgct acatgttcaa 2340 
atgcaaagca gtctctctct gtgtacagtg 2400 
cagaacactg tgaaacactt aacataagaa 2460 
gagcagcttt ctccacagga acacagtaac 252 0 
caagacacct cagaggccat agggacaacc 2580 
ggcacaggtc ccagcaactg atcctcagtg 2640 
tctcttttga aggggaaaga aagaatttca 2700 
atgagttagt aaattccaaa aaaaaaagat 2 760 
ttgtaaagtg cgcaagtgca ataatttaaa 2 82 0 



73 



gaggtcttat ctttgcattt 
attcatttgc agtctttgta 
taatcattta ttataactca 
atatgcatat atgggtgtta 
aagaagtaag aaatcttttg 
gatacatgaa atatatttag 
attgaataca acatttttaa 

<210> 2 

<211> 5752 

<212> DNA 

<213> mouse 

<400> 2 

aagtgtaaat aaaataaaca tctaataaaa aaaattacat accatagagg aacaagataa 60 
tttctgccca acttcatacc ctccagcgta tagtgttgag gtttggtctg ttgctgtgta 120 
ttgtaatgta atgttaaatt ctctacctga aggtctaggc ctacaagtga attctcatgt 180 
ttatagagtt ttgttgtgca aaccttgttc cttaatttaa aactatggtt aaaaaacaaa 240 
acaaaactgg ctacagccaa taactgaagg gggttacctt gttgaagggg tggaaaagag 3 00 
agaggaggaa gaagggagtt caagagaagg agaagaacaa gaggagagga ggaagctgcc 3 60 
acgaggggag atgggccatg agaacttggc caggagaaat agccagtatc tggagtacac 42 0 
cactgaggag gtagccaggc tagcagttag aagagtagat taggggttat ttttccccca 4 80 
ctccacatag ttatcaaagc caaataaaat aaccatagtc tgagtctcat ctatttgtaa 540 
gctagttggg tataagatta atttggctgt actacagttt agatttctaa cataggaact 600 
atcaaaaact tgctcaaaca agaacatgct gacaatattt taaaatgatt atttatattg 660 
tttgcacttt ctaaagtttc ttctaaatgt tccatggtca aattaaaaaa tatacatatt 72 0 
ggctattaaa ttcgtctaag tggggctgga gagatagctc agaggttaag agcactgact 78 0 
gctcttccag aggtcctgag ttcaattccc agcgaccaca tggtggctca cagccatctg 840 
taatagatag gatctgacgc cctcttctgg agtgtctgaa gacagctaca atgtactcat 900 
atatattaaa taaataatat tagaaaattc ttctaagtgt atcatttata gaatatttaa 960 
tatataaagt aaatgcctca ggaaatataa acttggaatt aaatcaaaga acttcatgag 1020 
tagtgggcca caaaaaatgt gtaccagggg aagaccggag ggaggggaga aggaagggat 1080 
ggagatagaa ttttgcctct gcattccttg ggctggcaca ggtataatgc tgtgggaatt 114 0 
gggaaactac aaggaagctg caaagctggg cggaactcgt ttccgcaagc tgggctcatc 12 0 0 
taagtgtcca tgcatggctg ccacactgca gtgaacttta aaacatttgt gttccagaga 1260 
tgtagagatg ctcacaatag tacaaaggcg ggagggaggt atttccagac taagaggaag 13 2 0 
aaaaaccatt gctgattaaa catctgcata tgagcgcccc cacctccata cacacacaca 13 80 
cacacacaca cacacacaca caaccaaaca gaacaaatac acatgcatgt ctacagcctg 1440 
caggaacaaa atggtatgtc tgtgaggaac caggagatgc acaggtccta acctctgtct 1500 



ataaattata 


aatattgtac 


atgtgtgtaa 


tttttcatgt 


2880 


tttaaaaaaa 


ctttactgtt 


atgtttgtat 


aatagaacat 


2940 


gacaaggtgt 


aaataaattc 


ataattcaaa 


cagccagtat 


3000 


cattgcaaaa 


atctctatct 


ttgttctatt 


cacatgctta 


3060 


tggatatgta 


attatacata 


taaagtatat 


atatatgtat 


3120 


aaatgttcat 


aattttaatg 


gatattcttt 


ggtgtgaata 


3180 


aatgaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaa 


3236 
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cctacaagcc ctgaagtctg gtcagggtca 
tagtgaaaga tttttttctt caactctagg 
ctcaggagga aacattcaga caactgatgc 
gtagttagtt gtgctcatgt atgtgctaga 
aacgccttag agacccggcc tttcaatgag 
ggtctaaact atggtgtcag ttttaataga 
ccttttcgtt ttcaatgctg acttctcccc 
gagtgtaagg gacaactttt aaggaggcgt 
gtgcctgtca tcaccccact tgactgacat 
tgtagggaac ggtggctcca ggtggaggca 
ctttggtttt gaaagtctct tctctgtata 
tgatgctgtg cgcagcagga ttcttggatg 
gcaaagtgtc aggctgactc gacagtcacc 
tcagccacgg cttccacttg catggctatt 
ctggctgact ggcattatct atgctaagtt 
ttctcactgt ctttgaaaca aagctgtacg 
catgcaatga caaagtgctc agtagtggaa 
actataattg tgaggatttg ttactggaac 
cagggtggaa ccttagctga atatagtgtg 
agtgctcaat ctccaggtac tatatataca 
atccccaaac acttgtttat cgtgtagcgt 
ccactttctt ggtttggtca ccccgatccc 
tataaatgat gtatgggaat agtgtttcca 
tgttcattta ctgtacaaaa acccagtgca 
tacagtgccc cacctattta aaaatcacgt 
ataagaacaa acgcagcgtc tggattcttt 
cagtaacaaa agaggtccgc cgccatccac 
gacaacctcc ttgctggcca acacctgctg 
ctcagtggat gggtctgcag ccaaagcctt 
aatttcaagc ttatgatatc caatattatt 
aaaagatgat tttatatgta tgacataaaa 
atttaaagag gtcttatctt tgcatttata 
ttcatgtatt catttgcagt ctttgtattt 
agaacattaa tcatttatta taactcagac 
ccagtatata tgcatatatg ggtgttacat 
atgcttaaag aagtaagaaa tcttttgtgg 
tatgtatgat acatgaaata tatttagaaa 
gtgaataatt gaatacaaca tttttaaaat 
aaaatttttt tttttttttt ttattccaga 



aatgtacaaa agcaggctaa ggaagctgtt 1560 
aacaacctat ttcctaggat ttggagagtg 1620 
tctctgtgta ccccagattc aggtattggg 1680 
tatattagca cagcctgcct tctgctgcac 1740 
cttagcttgt gctctgtttc tgctctctta 1800 
acaaaagtat gcatcttgcc ttggcttgag 1860 
tttctctcct gtgctcacct tacctttcca 1920 
gtccctggta ggggcatccc tgttcaccag 1980 
ctaccctggt gactatgggt tcctcttgtt 2040 
tcaatctgtt gggttctggt tcccggctgc 2100 
ttcctaccct gcatttgctt tgtgtggtgc 2160 
actctccatc agtcacagac tccccctgtt 2220 
gtaaaatctg agtcagtcac acacaggctg 2280 
ctattttcac acgtgagttt ctgttgctgg 2340 
gaaatcaggg gtgcccagca gagcccatca 2400 
gtttgatcga tgaacgtatt taaagcattt 2460 
ggcaggctgt gaccagtctg cctgctcctt 2520 
agtacatgga ggcctgacct tgtgggggca 2 58 0 
tgtctcaaga ggaagtcagg gtactagctc 2 64 0 
tttgcccgtt ttatctctaa tgtgaaataa 2700 
acctaaaaga ctattctatt atgggtgtcc 2760 
ccggtcttct gctgtatcta gaacagtgac 2820 
tatgatctgt tgtctggagt atatgctaca 2 880 
gctgatgatg caaagcagtc tctctctgtg 2940 
acttgcccag aacactgtga aacacttaac 3000 
ccaaggagag cagctttctc cacaggaaca 3 060 
acccagccaa gacacctcag aggccatagg 3120 
gagcaggggc acaggtccca gcaactgatc 318 0 
aatgggctct cttttgaagg ggaaagaaag 3240 
atagttgatg agttagtaaa ttccaaaaaa 3300 
aaaatctttg taaagtgcgc aagtgcaata 3360 
aattataaat attgtacatg tgtgtaattt 3420 
aaaaaaactt tactgttatg tttgtataat 3480 
aaggtgtaaa taaattcata attcaaacag 3540 
tgcaaaaatc tctatctttg ttctattcac 3600 
atatgtaatt atacatataa agtatatata 3660 
tgttcataat tttaatggat attctttggt 372 0 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 37 80 
gattaaagac actagatctt taaccttgaa 3840 
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gggcaggcaa gaggtcggca atgctgtcaa catagaagtc agggaccatt ttcttcttga 3 90 0 
acatgcagtc actttcctga ttgctcttca catcctcaag gctccggaat tccgggggtg 3960 
tggtgggctt tgatctcagg actctggagg cagaagcagg cagatctctg tgaatatgag 4 02 0 
gccagcctgc actacacaga gctccagacc agtcatggct acatcatgaa accctgtctc 4080 
aaaaagaaaa taaaaactgt tgtgtttcta ccatagtgtt aaactcagag tctgagtaat 414 0 
gtcgggctga catgctcggg tgtttaacat accttcagct ttgacgaggc gctgaacagt 4200 
caaagtctgg ccttggggag cggtggctgt gtttgtgctc aagtccaccg tgaaatcctg 4260 
attgtgaatt tggacaaccg tgtccttctt cttggccttc catgcaacct ccaacttcat 4320 
gttggtcatt ttgtcaaaac actgtgtgat gtttttatca atatactgcc attccacata 4380 
tgtagagatg tagtctgcct ggctttcctt ttctttagcc aatcgaatgc tcttgatcat 4440 
gccctcaatc tcatctctag cttttatcac gtctctgcta attcctgaaa cttgaatcga 4500 
agttttcttc tggttcatct caatggtgat gttcagttcc ttctgaatct cattcagttt 4560 
ctcgtactcc tccatgtcaa agtcactgac acactcatcg tcattggtgt aggaaagctg 462 0 
ctctttggta atcagttcct ttagccagga gattgttttg ttcacactgt ctacccctga 4680 
accacatacc tggaaaactg tgtgctctat tttcttttcc aaaaccaggg tgttcttttt 4740 
gggggaagct tgcttgggaa agccaagaaa ggctaaagag aaaatggaaa ttaatgtttc 48 0 0 
ttttactccc ttcaacatca aggttaggaa tatgtatttc ataaaagcta acaactcaca 4860 
ggcaatctta gacatcactg actgcttggc aggcgactgc ttggggggag ctggagagcc 4 92 0 
ttctctttct ttcatgttgt cgtaaaaaaa ttgcagaata tggggctgga agataacaac 4980 
tttaactctc ttcacagcct gcactgattt tttctggaca aattcttcaa tggcatctat 5040 
tatcgctttt gctactacgt ttgggtcctg ttgagcattt ccttcaaaaa caaaaaaagc 5100 
acatttttaa aaagtcaagg ttaagatcca cctgcaaaaa aaagctgcaa tataagcgag 5160 
gaattctagt tgtcacagga aataaaaatg tctgttccca ctataatcaa tgtagactga 522 0 
taatattatg ccagcaaata gttttgaagt cctaggcaca gtgggaggag gttttgttcc 5280 
acgctgttca taagccaata ccccagcaaa agaccttaaa ggacaacttg taatttggga 5340 
cattcacatc tgtcctcttc atctgatctg gctcccagtg tcactctcta acacggtcct 5400 
tagagggaca atttatccct gcctctgctt gatcttatgc atgtatctgt attcttccag 5460 
ccatccctgg cgacctgatt tttctaaggc acccaaaact gtaagctact tcttataatc 552 0 
tataattctg agcatattag ttagcctgag cctccaggat atctttcttc cctatactca 5580 
gtccagtttt agctgcccag aaggattcaa agctgatcta cgagtagatc actcctgtct 5640 
acagcttgtt ccagatcttg tttctcaagc cctggaagcc atcagccagg taagattgta 5700 
aaacaatccc tttctaatca tgggtgtggc ccaaagtgaa tggccggaat tc 5752 

<210> 3 
<211> 475 
<212> DNA 
<213> mouse 

<400> 3 
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tgtatgggaa tagtgtttcc atatgatctg ttgtctggag tatatgctac atgttcattt 60 
actgtacaaa aacccagtgc agctgatgat gcaaagcagt ctctctctgt gtacagtgcc 12 0 
ccacctattt aaaaatcacg tacttgccca gaacactgtg aaacacttaa cataagaaca 180 
aacgcagcgt ctggattctt tccaaggaga gcagctttct ccacaggaac acagtaacaa 240 
aagaggtccg ccgccatcca cacccagcca agacacctca gaggccatag ggacaacctc 3 00 
cttgctggcc aacacctgct ggagcagggg cacaggtccc agcaactgat cctcagtgga 3 60 
tgggtctgca gccaaagcct taatgggctc tcttttgaag gggaaagaaa gaatttcaag 420 
cttatgatat ccaatattat tatagttgat gagttagtaa attccaaaaa aaaaa 475 



<210> 4 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence :primer 



<400> 4 

agggctgtca atcatgctgg 2 0 

<210> 5 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : primer 



<400> 5 

aaactcacgg tcggtgcagc 2 0 

<210> 6 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : probe 
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<400> 6 

attaaccctc actaaatgct gtat 

<210> 7 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : probe 

<400> 7 

cattatgctg agtgatatct ttttttttcg 

<210> 8 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : probe 
<400> 8 

gaacatgtag catatactcc agacaacaga tcatatgg 

<210> 9 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : probe 
<400> 9 

cagcttctcc acaggaacac agtaacaaag ag 

<210> 10 
<211> 35 
<212> DNA 
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<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : primer 
<400> 10 

ctatttcaca agagactgac cagccaataa atctc 35 



<210> 11 
<211> 7581 
<212> DNA 
< 2 1 3 > Unknown 



<400> 11 

cgcccgggca ggtctgttgg agggcagttg 
agaccccact gatggtgtgc tgcctttcag 
gatttgggca aagccacatt cctggagaag 
agctgctgat gaggcccagg gagtagccca 
tcctgctcca tgtggcatgg aaaaattata 
tctttctctc catccccagg tattagatga 
tgtggaaaag tggctgaaga ggaaaaccaa 
agtcagcagg taccaggata cgaatatgca 
agagcagcgc ctggacacgg gcggggacaa 
catcaggata gccacaaaag ccgacggatt 
tagcctgtgt gtgttcatac cacccgggat 
agggcccatc acccagggta ccaccatctc 
gttggtagag gatatccttg gggatgagcg 
aacccgcatc cagtctgttc tttgcttgcc 
catccttgaa ctgtacaggc actggggcaa 
tgcaacagcc aatcttgctt gggcttccgt 
tctcgccaaa cagaccgaac tgaatgactt 
taacatagtt gccatagact ctctacttga 
gaacgccgac cgctgcgcgc tcttccaggt 
cctgtttgac attggggagg agaaggaggg 
cagattttcc attgagaaag ggattgctgg 
cattcccgat gcctacgcgg accctcgctt 
caccacgagg aacattctgt gtatgcccat 
gcagatggtg aacaagatca gcggtagcgc 
gatgtttgct gtcttctgcg cactggcctt 



gtcaacctga ccagagagag ctgagctgga 60 
tccaggaaga aagaaaggaa ggattctgag 12 0 
tctgtatact gatgccaaac ccaagagctg 180 
cgcgccctga gctgttggct agcaaggcct 240 
tggtttgacg gatgaaaagg tgaaggccta 3 00 
atttgtttct gaaagtgtta gtgcagagac 360 
caaagcaaaa gatgaaccat ctcccaagga 42 0 
gggagtcgtg tacgagctga acagctacat 480 
ccacctgctc ctctatgagc tcagcagcat 540 
tgcactgtac ttccttggag agtgcaataa 600 
gaaggaaggc caaccccggc tcatccctgc 660 
tgcctacgtg gccaagtcta ggaagacgtt 72 0 
atttcctcga ggtactggcc tggaatcagg 780 
cattgtcact gccattggag acttgattgg 840 
agaggccttc tgcctcagcc atcaggaggt 90 0 
agcaatacac caggtgcagg tgtgtagagg 960 
cctactcgac gtatcaaaga catactttga 1020 
acacatcatg atatatgcaa aaaatctagt 10 80 
ggaccacaag aacaaggagc tgtactcgga 114 0 
gaagcccatc ttcaagaaga ccaaggagat 1200 
tcaagtggca agaacaggcg aagtcttgaa 1260 
taacagggag gtggacctgt acacaggcta 1320 
agtgagccga ggcagcgtga ttggcgtggt 13 8 0 
cttctccaag acagacgaga acaacttcaa 1440 
gcactgtgct aacatgtacc acaggatccg 1500 
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ccactcagaa tgcatctaca gggttaccat 
ctccgaggag tggcaaggcc tcatgcgctt 
cgagctattc cactttgaca ttggtccttt 
catgatccat cggtcttgtg ggacatcctg 
catgtctgtg aagaagaact atcggcgggt 
ggtggcacac tgcatgtatg ccatacttca 
gcgcaaaggc ctgctaattg cgtgtctgtg 
cagctacctg cagaagttcg accaccccct 
gcaacaccac ttctcccaga cggtgtccat 
caccctgagc tccagcgagt acgagcaggt 
caccgacctc gccctatact ttgggaacag 
gtcgctgaac ctccacaacc agtcccatcg 
ctgtgatctt tgctctgtga ccaaactatg 
atatgcagaa ttctgggctg agggtgatga 
tatgatggac agagacaagc gagatgaagt 
tgtggccatt ccctgctata ccaccttgac 
gaaggcctgc agggataacc tcaatcagtg 
aatgtggatt tcaggcccag gcccggcgcc 
gaaggttgaa gactgatcct gaagtgacgt 
tgcttctgtg acttcgttct ttttgttttc 
cgtcgcatat ccatgtgaag cagacgactc 
gcaacccagg ctctgccgtg ttcagacgtc 
atgctatttg ctcccaggcc agcactgcac 
gttcttgcct gcatcctccc atgagggtgt 
tgcttggtgg cattggttag gaatgggaca 
ccttcttata ggttaactga gtttgtggcc 
aggtgacaga gaaatccaaa ctgttgatta 
tatctggggg cacataggtg agtctgctcc 
ccaggggaca cagggtacat cccaggcatc 
tcaaagaatt aaaacacctc ccctccccct 
ctttatacaa agaaaataaa agtaaggcat 
ggtaaaaaaa aagcatgtga atnntaacaa 
aattttagtc acgtccaaaa caaaaagatt 
aaggctccac agcatggcgt ccgtctccca 
caggcaggac agagaggagg gctgcagggc 
caccattcag acatccataa ggaatgccaa 
ttctagagaa gccaggacac cctgagcctt 
acaccgtggg gattttcagg atagcatgga 
gagccttgag aaggagagac tgaccagaaa 



ggagaagctt tcctaccaca gcatctgcac 1560 
caacctacca gcacgcatct gccgggacat 1620 
cgagaacatg tggcctggga tctttgtcta 1680 
ttttgaactt gaaaaattgt gccgttttat 1740 
tccttaccac aactggaagc atgcagtcac 1800 
aaacaacaat ggcctcttca cagacctcga 1860 
ccatgacctg gaccacaggg gcttcagtaa 1920 
ggcggcgctg tactccacct ccaccatgga 1980 
ccttcagctg gaagggcaca atatcttctc 2 04 0 
gctggagatc atccgcaaag ccatcatcgc 2100 
gaagcagttg gaggagatgt accagacagg 2160 
agaccgtgtc atcggcttga tgatgactgc 222 0 
gccagttaca aaattgacag cgaatgatat 22 8 0 
gatgaagaag ctgggcatac agcccattcc 2340 
ccctcaaggg cagctcggat tctacaatgc 2400 
gcagatcctc ccacccacag agcctctgct 2460 
ggagaaggta attcgcgggg aagagacagc 252 0 
tagcaagagc acacctgaga agctgaacgt 2580 
cctgatgtct gcccagcaac cgactcaacc 2640 
aaggggtgaa aaccccctgt cagaaggtac 2700 
cctgcttgcc gcacacacct cggacagtga 2760 
ggctactccg tggctccacc tgacctccga 2820 
tgtctggagg gggcagagac cacaggagag 2 8 80 
ggccagttcc ctagttctgt gccatgctgc 2 940 
cacgcccctt gttgtgaagt ttacatgtga 3000 
tggacacatg taatgaaggt cacagtccac 3060 
caggtgcact acaggtatgc tctttcagtc 3120 
actcagaann aagcatacct ctgccctcat 3180 
ggggaactga agctctcact tcaaaccatg 3240 
cactgtagcc ttcgacaact gcgccaatcc 3300 
ataaatttcc tccagcaagc aaatcttgtg 3360 
cntctanant ntcncngnat gttatggcag 342 0 
attccagaag atacctcatc ctatgcctga 3480 
gggttctgat ccgtctcctc acggtgcaat 3540 
taccacattg acccagaagg tatctcctct 3600 
atgctgtatt gaatagttct ctgtgtgact 3660 
tccnggggaa ctctaaggag tcacaggttc 3 72 0 
gacagagatc cggtcgttgt tctcactcgt 3780 
cactcactca gcactctgca ggagcaggag 3 84 0 



80 



aagatacttt aagatgaatc ttggatagat 
gcttggcatt tgcaaagtct attcagtttc 
agtgggctga acactgtaac actgtacatg 
catctcctcc cctgctgtgt cctactccat 
aagctatcac aacaccaggg ctgtgcacac 
atgtatgtac agcacacaca cacacacaca 
ttataaaaag cgacagctac cccatatcaa 
tccataagga attatcatga gtgtgttctc 
tgaagctggt ccacrtctat aaacaggtga 
acacaaattg atcttctatc atcttggaat 
cgaccgttta attcaggcat tccgaaggca 
aaaaggaccc tggcattggg aaacctatga 
ttacagaagt ctccttgrat ttgccctgcc 
gatttattgg ctggtcagtc tcttgtgaaa 
aaaactatac ctgggaagac ctttgcaaca 
ggaggcataa atctgcccgg aataaactag 
ttggttttag aattcacaca ggcaggattg 
tttaacatac aggcatggga atcctgcctc 
gttttttctc tccaaaggtt tccaggaatt 
ataataatga atataaacaa tgctcacctc 
gataacacaa attttatcgc aatggttatt 
cttttgttgt ggttgtttct gagaaaatgt 
gaagtattga tcccgggcag caaaatacag 
agttcatcag agggcctgag aagctgcggg 
ggtggtcagc ctccccttgc caagaagaga 
cgcctgaaga gtgaccagtg ctggcccgac 
aaaaaatagg cagtttgatg tgacctgttt 
agcattttta ttttatactc atccagtgaa 
gctagatata ttagcacagc ctgccttctg 
aatgagctta gcttgtgctc tgtttctgct 
aatagaacaa aagtatgcat cttgccttgg 
ctcccctttc tctcctgtgc tcaccttacc 
aggcgtgtcc ctggtagggg catccctgtt 
tgacatctac cctggtgact atgggttcct 
gaggcatcaa tctgttgggt tctggttccc 
tgtatattcc taccctgcat ttgctttgtg 
gatgactctc catcagtcac agactccccc 
caccgtaaaa tctgagtcag tcacacacag 
tattctattt tcacacgtga gtttctgttg 



tttgatacac ccaataccat acacacagga 3900 
cttccgcgct ctgacccacg gttgtagcgg 3960 
cgatttcccc atgggcttct aaaatgtcac 4020 
ttactggtta caaggtgatg tcaacaagag 4080 
gtgcacacac atgtatgcac aagcacacag 4140 
ccccaaaagg agagaaaagg aagaaaacat 42 0 0 
aatagtcttt cctgtaggaa acaggagctc 4260 
ccatcagtgc actctcccag gggtgctcac 4320 
cactggctgc agcaaaaagc cattcgatcc 43 80 
ctgaattgca gggaggagca gyatgtaaga 4440 
tgagcgcatg gattctrtca ccaagcgtat 4500 
cggactgttt ttgctgtaga agtagggatt 4560 
tggggcagtt ttgcagagga acctgccaga 4 62 0 
tagtatcatg tgagaaacag tttgtagaaa 4680 
ttgttccttc catgggccaa gactcagtta 4740 
gccaggatac agccatgttt agttaataat 4800 
gtttttttgt gtcttggcaa gtggagcata 4860 
ttagcttttc ccaccctctt gtctcaccaa 4920 
tctcattaat ggctgatgca aacttagtga 4980 
accaaaatta tattatttgc agtcatttgt 5040 
atttaatttg tggccacaca ctgtggttat 5100 
tcttggatat gtaagtgcca ataccagtgt 5160 
cctaaggttt gtaaacatca attctatctc 5220 
gcagtgtaaa gtaaagtatg ctgggctggt 52 8 0 
gcaattgaat cctgtcccca gctccctcca 5340 
ggatcgctga gatattctcc cataatggca 5400 
agtgtggctc tcctcttttg agcatgtgtt 5460 
ctctgctctt ccaagtgtgt tcatgtatgt 5520 
ctgcacaacg ccttagagac ccggcctttc 5580 
ctcttaggtc taaactatgg tgtcagtttt 5640 
cttgagcctt ttcgttttca atgctgactt 5700 
tttccagagt gtaagggaca acttttaagg 5760 
caccaggtgc ctgtcatcac cccacttgac 582 0 
cttgtttgta gggaacggtg gctccaggtg 5880 
ggctgccttt ggttttgaaa gtctcttctc 5940 
tggtgctgat gctgtggcag taggatcttg 6000 
tgttgcaaag tgtcaggctg actcgacagt 6060 
gctgtcagcc acggcttcca cttgcatggc 6120 
ctggctggct gactggcatt atctatgcta 6180 
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agttgaaatc aggagtgtgc ccagcagagc ccatcattct cactgtcttt gaaacaaagc 6240 
tgtacggttt gatcgatgaa cgtatttaaa gcatttcatg caatgacaaa gtgctcagta 63 00 
gtggaaggca ggctgtgacc agtctgcctg ctccttacta taattgtgag gatttgttac 6360 
tggaacagta catggaggcc tgaccttgtg ggggcacagg gtggaacctt agctgaatat 6420 
agtgtgtgtc tcaagaggaa gtcagggtac tagctcagtg ctcaatctcc aggtactata 6480 
tatacatttg cccgttttat ctctaatgtg aaataaatcc ccaaacactt gtttatcgtg 6540 
tagcgtacct aaaagactat tctattatgg gtgtccccac tttcttggtt tggtcacccc 6600 
gatcccccgg tcttctgctg tatctagaac agtgactata aatgatgtat gggaatagtg 6660 
tttccatatg atctgttgtc tggagtatat gctacatgtt catttactgt acaaaaaccc 6720 
agtgcagctg atgatgcaaa gcagtctctc tctgtgtaca gtgccccacc tatttaaaaa 6780 
tcacgtacaa ncccagaaca ctgtgaaaca cttaacataa gaaacaaacg cagcgtctgg 6840 
attctttcca aggagagcag ctttctccac aggaacacag taacaaaaga ggtccgccgc 6900 
catccacacc cagccaagac acctcagagg ccatagggac aacctccttg ctggccaaca 6960 
cctgctggag cagggcacag gtcccagcaa ctgatcctca gtggatgggt ccgcagtcaa 7020 
agccttaatg ggctctcttt tgaaggggaa agaaannttt caagcttatg atatccaaca 7080 
ttattatagt tgatgagtta gtaaattccg aaaaaaaaag atgattttat atgtatgaca 714 0 
taaaaaaaat ctttgtaaag tgcgcaagtg caataattta aagaggtctt atctttgcat 7200 
ttataaatta taaatattgt acatgtgtgt aatttttcat gtattcattt gcagtctttg 7260 
tatttaaaaa aactttactg ttatgtttgt ataatagaac attaatcatt tattataact 7320 
cagacaaggt gtaaataaat tcataattca aacagccagt atatatgcat atatgggtgt 73 80 
tacattgcaa aaatctctat ctttgttcta ttcacatgct taaagaagta agaaatcttt 7440 
tgtggatatg taattataca tataaagtat atatatatgt atgatacatg aaatatattt 7500 
agaaatgttc ataattttaa tggatattct ttggtgtgaa taattgaata caacattttt 7560 
aaaatgaaaa aaaaaaaaaa c 75 81 



<210> 12 
<211> 7618 
<212> DNA 
<213> mouse 



<400> 12 

cgcccgggca ggtctgttgg agggcagttg gtcaacctga ccagagagag ctgagctgga 6 0 
agaccccact gatggtgtgc tgcctttcag tccaggaaga aagaaaggaa ggattctgag 12 0 
gatttgggca aagccacatt cctggagaag tctgtatact gatgccaaac ccaagagctg 180 
agctgctgat gaggcccagg gagtagccca cgcgccctga gctgttggct agcaaggcct 240 
tcctgctcca tgtggcatgg aaaaattata tggtttgacg gatgaaaagg tgaaggccta 300 
tctttctctc catccccagg tattagatga atttgtttct gaaagtgtta gtgcagagac 360 
tgtggaaaag tggctgaaga ggaaaaccaa caaagcaaaa gatgaaccat ctcccaagga 42 0 
agtcagcagg taccaggata cgaatatgca gggagtcgtg tacgagctga acagctacat 480 
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agagcagcgc ctggacacgg gcggggacaa ccacctgctc ctctatgagc tcagcagcat 540 
catcaggata gccacaaaag ccgacggatt tgcactgtac ttccttggag agtgcaataa 600 
tagcctgtgt gtgttcatac cacccgggat gaaggaaggc caaccccggc tcatccctgc 660 
agggcccatc acccagggta ccaccatctc tgcctacgtg gccaagtcta ggaagacgtt 72 0 
gttggtagag gatatccttg gggatgagcg atttcctcga ggtactggcc tggaatcagg 780 
aacccgcatc cagtctgttc tttgcttgcc cattgtcact gccattggag acttgattgg 840 
catccttgaa ctgtacaggc actggggcaa agaggccttc tgcctcagcc atcaggaggt 900 
tgcaacagcc aatcttgctt gggcttccgt agcaatacac caggtgcagg tgtgtagagg 960 
tctcgccaaa cagaccgaac tgaatgactt cctactcgac gtatcaaaga catactttga 1020 
taacatagtt gccatagact ctctacttga acacatcatg atatatgcaa aaaatctagt 1080 
gaacgccgac cgctgcgcgc tcttccaggt ggaccacaag aacaaggagc tgtactcgga 114 0 
cctgtttgac attggggagg agaaggaggg gaagcccatc ttcaagaaga ccaaggagat 12 0 0 
cagattttcc attgagaaag ggattgctgg tcaagtggca agaacaggcg aagtcttgaa 12 60 
cattcccgat gcctacgcgg accctcgctt taacagggag gtggacctgt acacaggcta 1320 
caccacgagg aacattctgt gtatgcccat agtgagccga ggcagcgtga ttggcgtggt 13 80 
gcagatggtg aacaagatca gcggtagcgc cttctccaag acagacgaga acaacttcaa 1440 
gatgtttgct gtcttctgcg cactggcctt gcactgtgct aacatgtacc acaggatccg 1500 
ccactcagaa tgcatctaca gggttaccat ggagaagctt tcctaccaca gcatctgcac 1560 
ctccgaggag tggcaaggcc tcatgcgctt caacctacca gcacgcatct gccgggacat 1620 
cgagctattc cactttgaca ttggtccttt cgagaacatg tggcctggga tctttgtcta 1680 
catgatccat cggtcttgtg ggacatcctg ttttgaactt gaaaaattgt gccgttttat 1740 
catgtctgtg aagaagaact atcggcgggt tccttaccac aactggaagc atgcagtcac 1800 
ggtggcacac tgcatgtatg ccatacttca aaacaacaat ggcctcttca cagacctcga 1860 
gcgcaaaggc ctgctaattg cgtgtctgtg ccatgacctg gaccacaggg gcttcagtaa 192 0 
cagctacctg cagaagttcg accaccccct ggcggcgctg tactccacct ccaccatgga 1980 
gcaacaccac ttctcccaga cggtgtccat ccttcagctg gaagggcaca atatcttctc 2040 
caccctgagc tccagcgagt acgagcaggt gctggagatc atccgcaaag ccatcatcgc 2100 
caccgacctc gccctatact ttgggaacag gaagcagttg gaggagatgt accagacagg 2160 
gtcgctgaac ctccacaacc agtcccatcg agaccgtgtc atcggcttga tgatgactgc 222 0 
ctgtgatctt tgctctgtga ccaaactatg gccagttaca aaattgacag cgaatgatat 2280 
atatgcagaa ttctgggctg agggtgatga gatgaagaag ctgggcatac agcccattcc 2340 
tatgatggac agagacaagc gagatgaagt ccctcaaggg cagctcggat tctacaatgc 24 0 0 
tgtggccatt ccctgctata ccaccttgac gcagatcctc ccacccacag agcctctgct 2460 
gaaggcctgc agggataacc tcaatcagtg ggagaaggta attcgcgggg aagagacagc 2520 
aatgtggatt tcaggcccag gcccggcgcc tagcaagagc acacctgaga agctgaacgt 2580 
gaaggttgaa gactgatcct gaagtgacgt cctgatgtct gcccagcaac cgactcaacc 2 640 
tgcttctgtg acttcgttct ttttgttttc aaggggtgaa aaccccctgt cagaaggtac 2 700 
cgtcgcatat ccatgtgaag cagacgactc cctgcttgcc gcacacacct cggacagtga 2 7 60 
gcaacccagg ctctgccgtg ttcagacgtc ggctactccg tggctccacc tgacctccga 2 820 
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atgctatttg ctcccaggcc agcactgcac tgtctggagg gggcagagac cacaggagag 2880 
gttcttgcct gcatcctccc atgagggtgt ggccagttcc ctagttctgt gccatgctgc 2940 
tgcttggtgg cattggttag gaatgggaca cacgcccctt gttgtgaagt ttacatgtga 3000 
ccttcttata ggttaactga gtttgtggcc tgggacacat gtaatgaagg tcacagtcca 3060 
caggtgacag agaaatccaa actgttgatt acaggtgcac tacaggtatg ctctttcagt 312 0 
ctatctgggg gcacataggt gagtctgctc cactcagaag gaagcatacc tctsccctca 3180 
tccaggggac acagggtaca tcccaggcat cggggaactg aagctctcac ttcaaaccat 3240 
gtcaaagaat taaaacacct cccctccccc tcactgtagc cttcggcaac tgcgccaatc 3300 
cctttataca aagaaaatat aagtaaggca tataaatttc ctccagcaag caaatcttgt 3360 
gggtaaaaaa aaaaaatgtg aattttaaca acctctatat tttcactgta tgttatggca 342 0 
gaattttagt cacgtccaaa acaaaagatt attccagaag atacctcatc ctatgcctga 3480 
aagctccaca gcatggcgtc cgtctcccag ggttctgatc cgtctcctca cggtgcaatc 3540 
aggcaggaca ggaggaggtg cagggctacc acattgaccc agatggtatc tcctctcacc 3 60 0 
attcagacat ccataaggaa tgccaaatgc tgtattgaat agttctcctg tgtgactttc 3660 
tagagaagcc aggacacccc tgagcctttc ctgggaactc ctaaggaagt cacaggttca 3720 
caccgtgggg attttcagga tagcatggag accagagaat cccggttcgg ttgttctcac 3780 
tcggtgagcc ttgagaagga agagactgac cagaaacact cactcagcac tctggcagga 3840 
gcaggagaag atactttaag atgaatcttt gggatagatt ttgatacacc caataccata 3 900 
cacacaggag cttggcattt gcaaagtcta ttcagtttcc ttccacactc tgacccacgg 3960 
ttgtagcgga gtgggctgaa cactgtaaca ctgtacatgc gatttcccca tgggcttcta 4020 
aaatgtcacc atctcctccc ctgctgtgtc ctactccatt tactggttac aaggtgatgt 4080 
caacaagaga agctatcaca acaccagggc tgtgcacacg tgcacacaca tgtatgcaca 4140 
agcacacaga tgtatgtaca gcacacacac acacacacac cccaaaagga gagaaaagga 4200 
agaaaacatt tataaaaagc gacagctacc cccatattca aaaatagttc ttttccctgt 4260 
agggaaacag gtagctctcc ataaggaaat tatcatgagt gtgttctccc atcagtgcac 4320 
ttctcccagg ggtgctcact gaagctggtc cacgtctata aacaggtgac actggctgca 4380 
gcaaaaagcc attcgatcca cacaaattga tcttctatca tcttggaatc tgaattgcag 4440 
ggaggagcag catgtaagac gaccgtttaa ttcaggcatt ccgaaggcat gagcgcatgg 4500 
attctgtcac caagcgtata aaaggaccct ggcattggga aacctatgac ggactgtttt 4560 
tgctgtagaa gtagggattt tacagaagtc tccttggatt tgccctgcct ggggcagttt 4 62 0 
tgcagaggaa cctgccagag atttattggc tggtcagtct cttgtgaaat agtatcatgt 4680 
gagaaacagt ttgtagaaaa aaactatacc tgggaagacc tttgcaacat tgttccttcc 4740 
atgggccaag actcagttag gaggcataaa tctgcccgga ataaactagg ccaggataca 4800 
gccatgttta gttaataatt tggttttaga attcacacag gcaggattgg tttttttgtg 4860 
tcttggcaag tggagcatat ttaacataca ggcatgggaa tcctgcctct tagcttttcc 4920 
caccctcttg tctcaccaag ttttttctct ccaaaggttt ccaggaattt ctcattaatg 4980 
gctgatgcaa acttagtgaa taataatgaa tataaacaat gctcacctca ccaaaattat 5040 
attatttgca gtcatttgtg ataacacaaa ttttatcgca atggttatta tttaatttgt 5100 
ggccacacac tgtggttatc ttttgttgtg gttgtttctg agaaaatgtt cttggatatg 5160 
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taagtgccaa taccagtgtg aagtattgat cccgggcagc aaaatacagc ctaaggtttg 522 0 
taaacatcaa ttctatctca gttcatcaga gggcctgaga agctgcgggg cagtgtaaag 5280 
taaagtatgc tgggctggtg gtggtcagcc tccccttgcc aagaagagag caattgaatc 5340 
ctgtccccag ctccctccac gcctgaagag tgaccagtgc tggcccgacg gatcgctgag 5400 
atattctccc ataatggcaa aaaaataggc agtttgatgt gacctgttta gtgtggctct 5460 
cctcttttga gcatgtgtta gcatttttat tttatactca tccagtgaac tctgctcttc 5520 
caagtgtgtt catgtatgtg ctagatatat tagcacagcc tgccttctgc tgcacaacgc 5580 
cttagagacc cggcctttca atgagcttag cttgtgctct gtttctgctc tcttaggtct 5640 
aaactatggt gtcagtttta atagaacaaa agtatgcatc ttgccttggc ttgagccttt 5700 
tcgttttcaa tgctgacttc tcccctttct ctcctgtgct caccttacct ttccagagtg 5760 
taagggacaa cttttaagga ggcgtgtccc tggtaggggc atccctgttc accaggtgcc 5820 
tgtcatcacc ccacttgact gacatctacc ctggtgacta tgggttcctc ttgtttgtag 5880 
ggaacggtgg ctccaggtgg aggcatcaat ctgttgggtt ctggttcccg gctgcctttg 5940 
gttttgaaag tctcttctct gtatattcct accctgcatt tgctttgtgt ggtgctgatg 6000 
ctgtggcagt aggatcttgg atgactctcc atcagtcaca gactccccct gttgcaaagt 6060 
gtcaggctga ctcgacagtc accgtaaaat ctgagtcagt cacacacagg ctgtcagcca 6120 
cggcttccac ttgcatggct attctatttt cacacgtgag tttctgttgc tggctggctg 6180 
actggcatta tctatgctaa gttgaaatca ggagtgtgcc cagcagagcc catcattctc 6240 
actgtctttg aaacaaagct gtacggtttg atcgatgaac gtatttaaag catttcatgc 6300 
aatgacaaag tgctcagtag tggaaggcag gctgtgacca gtctgcctgc tccttactat 6360 
aattgtgagg atttgttact ggaacagtac atggaggcct gaccttgtgg gggcacaggg 642 0 
tggaacctta gctgaatata gtgtgtgtct caagaggaag tcagggtact agctcagtgc 6480 
tcaatctcca ggtactatat atacatttgc ccgttttatc tctaatgtga aataaatccc 6540 
caaacacttg tttatcgtgt agcgtaccta aaagactatt ctattatggg tgtccccact 6600 
ttcttggttt ggtcaccccg atcccccggt cttctgctgt atctagaaca gtgactataa 6660 
atgatgtatg ggaatagtgt ttccatatga tctgttgtct ggagtatatg ctacatgttc 6720 
atttactgta caaaaaccca gtgcagctga tgatgcaaag cagtctctct ctgtgtacag 6780 
tgccccacct atttaaaaat cacgtacaan cccagaacac tgtgaaacac ttaacataag 6840 
aaacaaacgc agcgtctgga ttctttccaa ggagagcagc tttctccaca ggaacacagt 6900 
aacaaaagag gtccgccgcc atccacaccc agccaagaca cctcagaggc catagggaca 6960 
acctccttgc tggccaacac ctgctggagc agggcacagg tcccagcaac tgatcctcag 7020 
tggatgggtc cgcagtcaaa gccttaatgg gctctctttt gaaggggaaa gaaanntttc 7080 
aagcttatga tatccaacat tattatagtt gatgagttag taaattccga aaaaaaaaga 7140 
tgattttata tgtatgacat aaaaaaaatc tttgtaaagt gcgcaagtgc aataatttaa 7200 
agaggtctta tctttgcatt tataaattat aaatattgta catgtgtgta atttttcatg 7260 
tattcatttg cagtctttgt atttaaaaaa actttactgt tatgtttgta taatagaaca 7320 
ttaatcattt attataactc agacaaggtg taaataaatt cataattcaa acagccagta 7380 
tatatgcata tatgggtgtt acattgcaaa aatctctatc tttgttctat tcacatgctt 7440 
aaagaagtaa gaaatctttt gtggatatgt aattatacat ataaagtata tatatatgta 7500 
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tgatacatga aatatattta gaaatgttca taattttaat ggatattctt tggtgtgaat 7560 
aattgaatac aacattttta aaatgaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa 7618 
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We claim: 



1 . A composition for treating a CAG repeat disorder comprising a compound which 
modulates PDE10A expression and a pharmaceutically acceptable carrier. 

2. A composition as claimed in claim 1, wherein said compound is selected from the group 
consisting of: quinpirole, alloxan, miconazole nitrate, MDL-12330A, and tetracyline 
derivatives such as demeclocycline. 

3. A composition as claimed in claim 1, wherein said disorder is Huntington's disease. 

4. A composition as claimed in claim 1, wherein said compound is selected from the group 
consisting of: 

(6R,12aR)-2,3,6,7,12,12a-Hexahydro-6-(5-benzofuranyl)-2-methyl-pyrazino[2', 
1' : 6, 1 Jpyrido [3 ,4-b]indole- 1 ,4-dione, 

(6R,12aR)-2,3,6,7,12,12a-Hexahych-o-6-(5-benzoflu-anyl)-pyrazmo[2',r:6,l]py rido[3,4- 
]indole- 1 ,4-dione, 

(6R,12aR)-2,3,6,7,12,12a-Hexahydro-6-(5-benzofuranyl)-2-isopropyl-pyrazino[ 
2\V:6,l ]pyrido[3 ,4-b]indole- 1 ,4-dione, 

(3S,6R,12aR)-2, 3,6, 7,12, 12a-Hexahydro-6-(5-benzofuranyl)-3-methyl-pyrazino[ 
2',1 ':6, 1 ]pyrido[3,4-b]indole- 1 ,4-dione, 

(3 S,6R, 12aR)-2, 3,6,7, 1 2, 1 2a-Hexahydro-6-(5-benzofuranyl)-2,3 -dimethyl-pyraz 
ino[2',r:6,l]pyrido[3,4-b]indole-l,4-dione. 
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5. A composition as claimed in claim 1, wherein said compound is selected from the group 
consisting of: KS-505, IC224,SCH 51866, IBMX and Dipyridamole. 

6. The use of a composition as claimed in claim 1 for treating a CAG repeat disorder 
comprising administering said composition to a subject in need of such treatment. 

7. The use of a composition of claim 1 for treating Huntington's disease comprising 
administering said composition to a subject in need of such treatment. 

8. A method for identifying a compound which inhibits or promotes a CAG repeat disorder, 
comprising the steps of: 

(a) selecting a control animal having PDE10A and a test animal having PDE10A; 

(b) treating said test animal using a compound; and, 

(c) determining the relative quantity of RNA corresponding to PDE10A, as between said 
animals. 

9. A method of claim 8, wherein said animal is a mammal. 

10. A method of claim 9, wherein said mammal is a mouse. 

1 1 . A method of claim 1 0, wherein said mouse is R6/2 transgenic mouse. 

12. A method of claim 8, wherein said CAG repeat disorder is Huntington's disease. 
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13. A method for identifying a compound which inhibits or promotes a CAG repeat disorder, 
comprising the steps of: 

(a) selecting a host cell containing PDE10A; 

(b) cloning said host cell and separating said clones into a test group and a control group; 

(c) treating said test group using a compound; and 

(c) determining the relative quantity of RNA corresponding to PDE10A, as between said test 
group and said control group. 

14. A method of claim 13, wherein said CAG repeat disorder is Huntington's disease. 

15. A method for detecting the presence of or the predisposition for a CAG repeat disorder, 
said method comprising determining the level of expression of RNA corresponding to 
PDE10A in an individual relative to a predetermined control level of expression, wherein a 
decreased expression of said RNA as compared to said control is indicative of a CAG repeat 
disorder. 

16. A method of claim 15, wherein said CAG repeat disorder is Huntington's disease. 

17. A method of claim 15, wherein said expression is measured by in situ hybridization. 

18. A method of claim 15, wherein said expression is measured using a polymerase chain 
reaction. 

19. A method of claim 15, wherein said expression is measured using a DNA fingerprinting 
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technique. 



□ 

m 
m 
a 
m 
a 
m 

i: 

□ 

ill 
□ 



